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(57) Abstract: The immunoad- 
hesions of the present invention 
are useful in treating rhinovirus 
infections. The immunoadhesions 
contain a chimeric ICAM 

„ _ ^_ molecule and may optionally 

f 'SuperMAS* also contain J chain and secretory 

f / Promoter \ Xbal compounds. The chimeric ICAM 

/ pr M ia / ' ' molecule is a fusion protein that 

W oSSoHuA2 ' Bf^*"* nas a rhinovirus receptor protein 

J , } B^S^-^L B ean Legumin linked to an immunoglobulin 

i-aoi nai Signal Peptide protein. This invention also 

includes the greatly increased and 
improved method of producing 
immunoadhesions in plants. 
Each of the components of an 
immunoadhesin is produced 
in a plant cell and thereby 
assembles within the plant cell. 
This method of producing the 
immunoadhesions of the present 
invention results in the efficient 
and economic production of these molecules. The present invention also contemplates the production of immunoadhesions in a 
variety of eukaryotic cells including plants and mammalian cells. The immunoadhesions of the present invention are useful as a 
therapeutic a g ainst the com mon .cold J n.humnns-which.iscaused-bY-rhir.oviRises.- " ~ 
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NOVEL IMMUNOADHESIN FOR THE 
PREVENTION OF RHINOVIRUS INFECTION 

5 PRIORITY 

This application claims benefit under §1 19(e) of U.S. Provisional Patent Application 
No. 60/200,298, filed April 28, 2000, entitled NOVEL IMMUNOADHESIN FOR THE 
PREVENTION OF RHINOVIRUS INFECTION, and naming J. W. Larrick and K. L. Wycoff 
as inventors. This application is incorporated herein by reference in its entirety and for all 
10 purposes. 

STATEMENT REGARDING FEDERALLY SPONSORED 
RESEARCH OR DEVELOPMENT 

Federal research support was provided in the form of an SBIR Phase I grant (R43 
AI43122) and SBIR Phase H grant (2R44AI43 122-02). 

15 FIELD OF THE INVENTION 

The present invention relates to immunoadhesins, fusions of the human rhiibvirus 
receptor protein and immunoglobulin, and the expression of immunoadhesins in plants. The 
therapeutic use of immunoadhesins for the prevention and treatment of human rhinovirus 
infection is also contemplated. 

20 BACKGROUND TO THE INVENTION 

The common cold is generally a relatively mild disease, however significant 
complications resulting from colds, such as otitis media, sinusitis and asthma exacerbations 
are common. Human rhinoviruses (HRV) cause up to 50% of all adult colds and 25% of 
colds in children (Bella and Rossmann, J Struct Biol 128:69-74, 1999, and Sperber and 
25 Hayden, Antimicrob Agents Chemother. 32:409-19, 1988). The cost to society runs into 

billions of dollar per year. These small, nonenveloped RNA viruses represent a subgroup of 
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picomavirus (Rueckert, Virology, pp. 507-548, eds. Fields, et al, Raven Press, Ltd. New 
York, 1990) X-ray crystallography of rhinovirus identified a capsid 300 A in diameter (1 A = 
0.1 nm) with icosahedral symmetry, constructed from sixty copies each of the viral coat 
proteins VP1, VP2, and VP3 (Rossmann, Nature 317:145-153, 1985). A surface depression 

5 or "canyon" on HRV was suggested as the receptor binding site (Colonno, et al, Proc Natl 
AcadSci USA. 85:5449-5453, 1985; Rossmann, etal Nature 317:145-153, 1985). Of the 
102 characterized HRV serotypes, 91 (known as the major group) share as their receptor a cell 
surface glycoprotein known as intercellular adhesion molecule-1 (ICAM-1) (Greve, et al, 
Cell 56:839-847, 1989; Staunton, et al, Cell 56:849-853, 1989); the binding site is located 

10 within N-terminal domain 1 (Greve, et al, J Virol 65:6015-6023, 1991; Staunton, et al t Cell 
61:243-254,1990). 

ICAM-1 is a membrane protein with five extracellular domains, a hydrophobic 
transmembrane domain, and a short cytoplasmic domain. ICAM-1 is expressed on many cells 
important in immune and inflammatory responses, and is inducible on others (Casashovas, et 
15 al, Proc Natl AcadSci U S A. 95:4134-9, 1998). ICAM-1 functions as a ligand for the 

leukocyte integrins LFA-1 and Mac-1 (Springer, Cell 76:301-14, 1994; Staunton et at. ', Cell 
61:243-254, 1990). On the cell surface, ICAM-1 is primarily a dimer due to association of the 
transmembrane domains (Miller, et al, J Exp Med. 182:1231-41, 1995; Reilly, et alJ 
Immunol. 155:529-32, 1995). 

20 Recombinant, soluble forms of ICAM-1 (sICAM-1) consisting of the five extracellular 

domains were shown to be effective in blocking ihinovirus infection of human cells in vitro 
(Greve, et al, J Virol 65:6015-6023, 1991; Marlin, et al, Nature. 344:70-2, 1990). 
Evaluation of sICAM-1 activity against a spectrum of laboratory strains and field isolates 
showed that all major strains of HRV are sensitive to sICAM-1 . Minor strains, which do not 

25 use ICAM as a receptor, were unaffected by sICAM-1 (Crump et al , Antiviral Chem 

Chemother. 4:323-327, 1993; Ohlin, al,Antimicrob Agents Chemother. 38:1413-5, 1994). 
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The anti-viral activity of soluble ICAM-1 in vitro appears to be mediated by more than 
one mechanism. These mechanisms include competition with cell-surface ICAM-1 for 
binding sites, interference with virus entry or uncoating, and direct inactivation by premature 
release of viral RNA and formation of empty capsids (Arruda, et al, Antimicrob Agents 
5 Chemother. 36:1186-1191, 1992;Gx&\o 9 etal 9 JVirol 65:6015-6023, 1991; Marling a/., 
Nature 344:70-2, 1990; Martin et al, J Virol 67:3561-8, 1993). 

The host range of HRV is restricted to primates. A recent study showed that soluble . 
ICAM-1 was effective in preventing rhinovirus infection in chimpanzees (Huguenel, et al, 
Am JRespir Crit Care Med. 155:1206-10, 1997). Although chimpanzees do not show clinical 
10 symptoms, infection was demonstrated by measuring seroconversion and virus shedding. A 
single dose of 10 mg of soluble ICAM-1 as an intranasal spray was effective at preventing 
infection by HRV- 16 when co-administered with HRV, or when the virus was administered 
ten minutes later. 

A human clinical trial with soluble ICAM-1 showed that it reduced the severity of 
15 experimental HRV colds (Turner, et al, JAMA 281:1797-804, 1999). In this trial a total of * 
196 subjects received either soluble ICAM-1 or placebo in various formulations. Some 
subjects were given soluble ICAM-1 or placebo starting seven hours before inoculation "with 
HRV 39 and others were started twelve hours after virus inoculation. Medications were 
administered as either an intranasal solution or powder, given in six daily doses for seven 
20 days (a total of 4.4 mg per day). In this study, soluble ICAM-1 did not prevent infection, as 
measured by either virus isolation or seroconversion (infection rate of 92% for 
placebo-treated vs. 85% of soluble ICAM-1 treated). However, soluble ICAM-1 did have an 
impact on all measures of illness. The total symptom score was reduced by 45%, the 
proportion of subjects with clinical colds was reduced 23% and nasal mucus weight was 
25 reduced by 56%. There was not a significant difference between the use of powder or 

solution formulations, or between pre- and post-inoculation groups. Treatment with soluble 
ICAM-1 did not result in any adverse effects or evidence of absorption through the nasal 
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mucosa. Also, there was no inhibition of the development of anti-HRV type-specific 
antibodies. 

As discussed, ICAM-1 is dimeric on the cell surface. Martin et al, in J Virol 
67:3561-8, (1993) first proposed that multivalent binding to HRV by a multimeric soluble 
5 ICAM might result in a higher effective affinity, termed avidity, and thus facilitate uncoating 
of the virus. They constructed multivalent, ICAM-l/immunoglobulin molecules, postulating 
that these would be more effective than monovalent soluble ICAM-1 in neutralizing HRV and 
thus would have increased therapeutic utility. These ICAM-l/immunoglobulin molecules 
included ICAM-1 ammo-terminal domains 1 and 2 fused to the hinge and constant domains of 

10 the heavy chains of IgAl (ICl-2D/IgA), IgM (ICl-2D/IgM) and IgGl(ICl-2D/IgG). In 
addition, five extracellular domains were fused to IgAl (ICl-5D/IgA). These ICAM- 
l/immunoglobulin molecules were compared with soluble forms of ICAM-1 having two 
(sICl-2D) and five (sICl-5D) domains in assays of HRV binding, infectivity and 
conformation. The ICAM-l/IgA immunoglobulin (ICl-5D/IgA) was 200 times, and the 

15 ICAM-l/IgM immunoglobulin (ICl-2D/IgM) and ICAM-l/IgG immunoglobulin molecules 
(ICl-2D/IgG) were 25 and 10 times, more effective than soluble ICAM-1. These molecules 
were highly effective in inhibiting rhinovirus binding to cells and disrupting the conformation 
of the virus capsid. The ICAM-l/IgA immunoglobulin molecules were effective in the 
nanomolar concentration range. Comparison of ICl-2D/IgA and ICl-2D/IgG showed that the 

20 class of Ig constant region used had a large impact on efficacy. 

A subsequent study compared the inhibitory activities of soluble ICAM-1 and 
ICl-5D/IgA against nine major HRV serotypes and a variant of HRV-39 selected for 
moderate resistance to soluble ICAM-1 (Crump, et al, Antimicrob Agents Chemother. 
38:1425-7, 1993). ICl-5D/IgA was more potent than monomelic soluble ICAM-1 by 50 to 
25 143 times on a weight basis and by 60 to 170 times on a molar basis against the standard 
serotypes. The HRV-39 variant was 38-fold more resistant to soluble ICAM-1 than the 
wild-type, and it was only 5-fold more resistant to ICl-5D/IgA. This is consistent with the 
hypothesis that virus escape from inhibition by multivalent molecules would be expected to 
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occur at lower frequency than virus escape from inhibition by monomeric soluble receptor 
(Martin, et al, J Virol 67:3561-8, 1993). An assay designed to measure viral inactivation 
showed that HRV-39 and HRV-13 were not directly inactivated to a significant extent by 
soluble ICAM-1 (<0.5 logio reduction in infectivity). However, incubation with ICl-5D/IgA 
5 resulted in a reduction of infectivity of these same viruses by about 1 .0 logio (Crump, et al, 
Antimicrob Agents Chemother. 38: 1425-7, 1994). Results by Martin et al (J Virol 
67:3561-8, 1993) suggest that the greater the valence, the greater the effectiveness of the 
molecules. Dimeric and decameric forms of ICl-2/IgM were separable by sucrose gradient 
sedimentation. The decameric form was five times more effective than the dimeric form at 
10 blocking binding ofHRV to HeLa cells. 

The ICAM-l/immunoglobulin molecules that have been described suffer from several 
drawbacks, including the laborious production techniques and high costs associated with 
those production methods. In addition, the previously described ICAM-l/immunoglobulin 
molecules have limited stability as multimers in the harsh environment in which the molecule 
15 must inactivate rhinoviruses. 

The immunoadhesins of the present invention have significant advantages over what 
has been described in the art. The immunoadhesins of the present invention that are 
expressed in plants would be tetrameric, rather than only dimeric. Immunoadhesins having 
multiple binding sites have a higher effective affinity for the virus, thereby increasing the 

20 effectiveness of the immunoadhesin. In addition, the association of secretory component and 
immunoglobulin J chain with the immunoadhesin of the present invention increases the 
stability of the immunoadhesin in the mucosal environment (Corthesy, Biochem Soc Trans. 
25:471-475, 1997). Secretory IgA, which is associated with secretory component, is the 
antibody isotype normally found in mucosal secretions, including milk and colostrum. Unlike 

25 other antibody isotypes, SIgA can pass through the gut with very little proteolytic 

degradation. It also is very stable in crude plant preparations at room temperature. A 
function of the secretory component appears to be to protect the antibody from the harsh 
environment of the mucosa (Paul, Fundamental Immunology, Raven Press, NY, Third 
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Edition, pp. 303-304, 1993). Furthermore, the immunoadhesin of the present invention are 
significantly less expensive to produce in plants than in animal cell culture, and production i 
plants would make it safer for human use, since plants are not known to harbor any animal 
viruses. 

SUMMARY OF THE INVENTION 

The present invention contemplates an immunoadhesin comprising a chimeric 
ICAM-1 molecule having a rhinovirus receptor protein linked to at least a portion of an 
immunoglobulin heavy chain, wherein J chain and secretory component are associated with 
the chimeric ICAM-1 molecule. 

In preferred embodiments, the immunoadhesin of the present invention is comprised 
of a rhinovirus receptor protein made of any combination of extracellular domains 1, 2, 3, 4 
and 5 of the rhinovirus receptor protein, ICAM-1, linked to an immunoglobulin heavy chain. 
Also contemplated by the present invention are immunoadhesins of the present invention in 
which the immunoglobulin is IgA, IgA u IgA 2 ,IgGi, IgG 2 , IgG 3 , IgG 4 , IgM, IgD, IgE or * - 
chimeric immunoglobulin heavy chain made up of domains or segments from different 
immunoglobulin isotypes. . 

In other preferred embodiments of the present invention, the immunoadhesin 
comprises multiple chimeric ICAM- 1 molecules associated with J chain and secretory 
component. The increase in valency results in a higher effective affinity for the rhinovirus, 
thereby increasing the effectiveness of the immunoadhesin. 

In a preferred embodiment of the present invention, all proteins used to make the 
immunoadhesin of the present invention are human proteins. In addition to production in 
plants or plant cells, the present invention contemplates an immunoadhesin expressed in 
mammalian cells, hairy root cultures, plant cells in tissue culture, and heterologous cells 
derived from plants, vertebrates or invertebrates. 
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In preferred embodiments of the present invention, the immunoadhesins are expressed, 
in plants, including monocotyledonous plants and dicotyledonous plants as a part of the plants 
genome. Expression in plants, as opposed to expression in cultured cells, allows for a 
significant reduction in the cost of producing the immunoadhesin. 

5 The present invention contemplates an immunoadhesin having plant-specific 

glycosylation. A gene coding for a polypeptide having within its amino acid sequence, the 
glycosylation signal asparagine-X-serine/threonine, where X can be any amino acid residue, is 
glycosylated via oligosaccharides linked to the asparagine residue of the sequence when 
expressed in a plant cell. See Marshall, Ann. Rev. Biochem., 41 :673 (1972) and Marshall, 

10 Biochem. Soc. Symp., 40:17 (1974) for a general review of the polypeptide sequences that 
function as glycosylation signals. These signals are recognized in both mammalian and in 
plant cells. At the end of their maturation, proteins expressed in plants or plant cells have a 
different pattern of glycosylation than do proteins expressed in other types of cells, including 
mammalian cells and insect cells. Detailed studies characterizing plant-specific glycosylation . 

15 and comparing it with glycosylation in other cell types have been performed* for example, in ; 
studies described by Cabanes-Macheteau et al. y Glycobiology 9(4):365-372 (1999), and 
Altmann, Glycoconjugate J. 14:643-646 (1997). These groups and others have shown that 
plant-specific glycosylation generates glycans that have xylose linked P(l,2) to mannose, but 
xylose is not linked P(l ,2) to mannose as a result of glycosylation in mammalian and insect 

20 cells. Plant-specific glycosylation results in a fucose linked a(l ,3) to the proximal GlcNAc, 
while glycosylation in mammalian cells results in a fiicoselinked ct(l,6) to the proximal 
GlcNAc. Furthermore, plant-specific glycosylation does not result in the addition of a sialic 
acid to the terminus of the protein glycan, whereas in glycosylation in mammalian cells, sialic 
acid is added. 

25 In other embodiments, the immunoadhesin of the present invention is part of a 

composition comprising plant material and the immunoadhesin, associated with J chain and 
secretory component. The plant material present may be plant cell walls, plant organelles, 
plant cytoplasms, intact plant cells, viable plants, and the like. The particular plant materials 
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or plant macromolecules that may be present include ribulose bisphosphate carboxylase, light 
harvesting complex, pigments, secondary metabolites or chlorophyll. Compositions of the 
present invention may have an immunoadhesin concentration of between 0.001% and 99.9% 
mass excluding water. In other embodiments, the immunoadhesin is present in a 
5 concentration of 0.01% to 99% mass excluding water. In other embodiments, the 

compositions of the present invention have plant material or plant macromolecules present at 
a concentration of 0.01% to 99% mass excluding water. 

The present invention also contemplates methods for the treatment or prevention of 
human rhinovirus infection in a subject, including reducing the infection by human rhinovirus 
of host cells susceptible to infection by the virus, or reducing the initiation or spread of the 
common cold due to human rhinovirus, by a method comprising contacting the virus with an 
immunoadhesin of the present invention, wherein the immunoadhesin binds to the human 
rhinovirus and reduces infectivity. The immunoadhesin could mediate infection by 
competition with cell-surface IC AM- 1 for binding sites, interference with virus entry or 
uncoating, and/or direct inactivation by premature release Qf viral RNA and formation of 
empty capsids (Arruda, et al, Antimicrob. Agents Chemother. 36:1186-1191" 1992; Greve, ei 
al, J. Virol 65:6015-6023, 1991; Martin, et al, Nature 344:70-2, 1990; Martin, et al, J. 
Virol 67:3561-8, 1993). In another embodiment, human rhinovirus infection in a subject is 
treated by a method comprising intranasally administering to the subject an effective amount 
of an immunoadhesin of the present invention, wherein the immunoadhesin reduces human 
rhinovirus infectivity. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 illustrates pSSPHuA2, vector in which DNAs encoding a chimeric ICAM-1 
molecule containing the first five domains of human ICAM-1 and the Fc region of human 
25 IgA2m(2) were fused. This vector contains the SuperMas promoter for driving the expression 
of a signal peptide and the constant regions of the human IgA2m(2) heavy-chain. Sequences 
encoding ICAM domains 1-5 were amplified, by PCR, to contain convenient restriction sites 
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(5' Spel and 3' Spel) for insertion between the signal peptide and the Cal domain. DNA 
encoding an ER retention signal (RSEKDEL) was appended to the 3* end of the heavy-chain 
in order to boost the expression level of the construct. 

BIG. 2 illustrates a chimeric ICAM molecule. 2A shows the DNA expression cassette 
from which the chimeric ICAM-1 molecule was produced. 2B shows the amino acid C 
sequence, after signal peptide cleavange, of the mature form of the fusion protein. Amino - 
acids introduced by the cloning procedure are underlined and mark the junction between the 
five extracellular domains of ICAM-1 and the Cal~Ccc3 domains of the IgA2m(2) heavy : , 
chain. The bolded N's indicate the fifteen potential glycosylation sites. - 

flG. 3 illustrates the expression of the immunoadhesin in independently transformed 
tobacco calli. 3A shows immunobldts of non-reducing SDS-polyacrylamide gels on which 
samples containing different transformed tobacco calli (C) and aqueous extracts (Aq) wore 
run and probed for the presence of human ICAM. The molecular weight markers are . • 
indicated, and the reference standard (R) was a mixture (-75 ng each) of human ICAM (-75 
kD) and human SigA (»250 kD). 3B shows immunoblots of nonreducing 
SDS-polyacrylamide gels containing Various fractions of partially purified immunoadhesin 
from callus Rhil 07-11. The purification fractions analyzed were juice (J), G-100 fraction , 
(G), sterile filtered G-100 fraction (SG), and a mixture of reference standards of human SigA - 
(75 ng) and human ICAM-1 (75ng) (RS). 

Blots were probed with antibodies against human ICAM (-ICAM), human IgA heavy 
chain (-a), human secretory component (~SC) and human J chain (~J). Secondary, 
enzyme-conjugated antibodies were-employed as necessary to label immuno-positive bands 
with alkaline phosphatase. 

FIG. 4 illustrates the results of an enzyme-linked immunosorbent assay (ELISA) 

showing competition between plant extract and soluble ICAM-1 for binding to an anti-ICAM 

mAb; For the assay, 96-well plates were coated with 0.25 jig soluble ICAM-l/ml. The 

squares represent the increasing concentrations of sICAM and the circles represent the 
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increasing amounts of callus extract (sterile filtered fraction from G-100) used to compete 
with the adhered ICAM for a constant amount of a mouse (anti-human ICAM) antibody. 

FIG. 5 illustrates the results of an assay showing the ability of an immunoadhesin to 
inhibit human rhinovirus killing of HeLa cells (cytopathic effect, or CPE, assay). 5 A shows 
the results of an assay comparing the CPE of human rhinovirus on HeLa cells in the presence 
of partially purified extracts containing either the immunoadhesin in the ICAM-Fc fusion 
(ICl-5D/IgA) or containing an antibody against doxorubicin. (The right side-up and 
upside-down triangles represent two extracts derived from Rhil07-1 1, containing the 
immunoadhesin.) SB shows the results of an assay comparing the CPE of human rhinovirus 
on HeLa cells in the presence of soluble human ICAM-1 or an extract from the 
immunoadhesin in the ICAM-Fc fusion (ICl-5D/IgA). The Inset shows scale expansion in 
the range of the IC50 for soluble ICAM (1.35 fig/ml) and for ICl-5D/IgA (0.12 |ig/ml; 11.3 
fold-less). 

FIG. 6 shows an evaluation of the production necessities for making 1 gram of 
finished immunoadhesin. In this diagram, the number of plants needed for 1 g of 
immunoadhesin, at 20% yield, at expected levels of expression and plant weight is illustrated. 
At different levels of immunoadhesin expression (mg/kg fresh weight) and overall recovery 
(set at 20%), the weight of each plant, and so the total number of plants, may be determined 
for a specified production target (1 g/harvest) within a window (dotted square) of reasonable 
possibilities. The number of required plants decreases, inversely, with the number of 
specified growth and re-growth periods. The expected biomass production, a function of time 
arid growth conditions, influences the time to harvest and the time between harvests. These 
growth periods can be adjusted to the realities of the purification schedule by staggering 
planting and harvesting dates. 

FIG. 7 shows fiie coding and amino acid sequences of each of the immunoglobulin 
genes and proteins listed in Table 2. 
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FIG, 8 shows the sequences of plasmids used to transform plants, as described in 
Example 2, for use in studies of the expression of immunoadhesins of the present invention. 

FIG. 8 A shows the nucleotide and protein sequences for plasmids PSSpICAMHuA2 

FIG 8 B shows the nucleotide protein sequences for the bean legumin signal peptide. 

FIG 8 C shows the nucleotide and amino acid sequence of the protein coding region 
ofpSHuJ. 

FIG 8 D shows the nucleotide and amino acid sequence of protein coding region of 
pSHuSC. 

FIG 8 E shows the nucleotide sequence of plasmids pBMSP- 1 . 

FIG 8 F shows the nucleotide sequence of plasmids pBMSP-lspJSC. 

FIG. 9 contains nucleotide and protein sequences SEQ ID NO: 1; SEQ ID NO: 2; 
SEQ ID NO: 3; SEQ ID NO: 4; SEQ ED NO: 5; SEQ ID NO: 6; SEQ ID NO: 7; SEQ ID NO: 
8, for ICAM-1 , and human IgA 2 and other nucleotide sequences. 

DETAILED DESCRIPTION OF THE INVENTION r : "' 2 ' 
A. Definitions. 

As used herein, the following abbreviations and terms include, but are not necessarily 
limited to, the following definitions. 

The practice of the present invention will employ, unless otherwise indicated, 
conventional techniques of immunology, molecular biology, microbiology, cell biology and 
recombinant DNA, which are within the skill of the art. See, e.g., Sambrook, et al t 
Molecular Cloning: A Laboratory Manual, 2nd edition (1989); Current Protocols In 
Molecular Biology (FM. Ausubel, etal eds. 9 (1987)); the series Methods In Enzymology 
(Academic Press, Inc.); M.J. MacPherson, et al, eds. Per 2: A Practical Approach (1995); 
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Harlow and Lane, eds, Antibodies: A Laboratory Manual (1988), and H. Jones, Methods In 
Molecular Biology vol. 49, ''Plant Gene Transfer And Expression Protocols" (1995). 

Immunoglobulin molecule or Antibody. A polypeptide or multimeric protein 
containing the immunologically active portions of an immunoglobulin heavy chain and 
5 immunoglobulin light chain covalently coupled together and capable of specifically 

combining with antigen. The immunoglobulins or antibody molecules are a large family of 
molecules that include several types of molecules such as IgD, IgG, IgA, secretory IgA 
(SIgA),IgM,andIgE. 

Construct or Vector. An artificially assembled DNA segment to be transferred into a 
10 target plant tissue or cell. Typically, the construct will include the gene or genes of a 

particular interest, a marker gene and appropriate control sequences. The term "plasmid" 
refers to an autonomous, self-replicating extrachromosomal DNA molecule. In a preferred • 
embodiment, the plasmid constructs of the present invention contain sequences coding for 
heavy and light chains of an antibody. Plasmid constructs containing suitable regulatory 
15 elements are also referred to as "expression cassettes." In ac preferred embodiment, a plasmid 
construct can also contain a screening or selectable marker, for example an antibiotic 
resistance gene. 

Selectable marker. A gene that encodes a product that allows the growth of transgenic 
tissue on a selective medium. Non-limiting examples of selectable markers include genes 
20 encoding for antibiotic resistance, e.g., ampicillin, kanamycin, or the like. Other selectable 
markers will be known to those of skill in the art. 

Transgenic plant. Genetically engineered plant or progeny of genetically engineered 
plants. The transgenic plant usually contains material from at least one unrelated organism, 
such as a virus, another plant or animal. 

25 Chimeric ICAM-1 molecule: The fusion of any combination of the extracellular 

domains 1, 2, 3, 4 and 5 of ICAM-1 with at least a part of an immunoglobulin heavy chain 
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protein, made by linking ICAM-1 sequence upstream of an immunoglobulin heavy chain gene 
sequence and expressing the encoded protein from the construct. 

Chimeric immunoglobulin heavy chain: An immunoglobulin derived heavy chain 
having at least a portion of its amino acid sequence derived from an immunoglobulin heavy 
5 chain of a different isotype or subtype or some other peptide, polypeptide or protein. 
Typically, a chimeric immunoglobulin heavy chain has its amino acid residue sequence 
derived from at least two different isotypes or subtypes of immunoglobulin heavy chain. 

Dicotyledonous plants (chcots): Flowering plants whose embryos have two seed 
halves or cotyledons. Examples of dicots are: tobacco; tomato; the legumes including alfalfa; 
10 oaks; maples; roses; mints; squashes; daisies; walnuts; cacti; violets and buttercups. 

Effective amount: An effective amount of an immunoadhesin of the present invention 
is sufficient to detectably inhibit rhinovirus infection, cytotoxicity or replication; or to reduce 
the severity or length of rhinovirus infection. 

Human rhinovirus (HRV): A nonenveloped RNA virus representing a subgroup of 
15 picornavirus, that is a major cause of the common cold in humans. Rhinoviruses are 
described in Rhinoviruses, Reoviruses, and Parvoviruses, pp. 1057-1059, Zinsser 
Microbiology, Joklik et aL 9 eds. Appleton and Lange (1992). 

Immunoadhesin : A complex containing a chimeric ICAM-1 molecule, and optionally 
containing secretory component, and J chain. 

20 Immunoglobulin heavy chain: A polypeptide that contains at least a portion of the 

antigen binding domain of an immunoglobulin and at least a portion of a variable region of an 
immunoglobulin heavy chain or at least a portion of a constant region of an immunoglobulin 
heavy chain. Thus, the immunoglobulin derived heavy chain has significant regions of amino 
acid sequence homology with a member of the immunoglobulin gene superfamily. For 

25 example, the heavy chain in an Fab fragment is an immunoglobulin-derived heavy chain. 
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Immunoglobulin light chain: A polypeptide that contains at least a portion of the 
antigen binding domain of an immunoglobulin and at least a portion of the variable region or 
at least a portion of a constant region of an immunoglobulin light chain. Thus, the 
immunoglobulin-4erived light chain has significant regions of amino acid homology with a 
5 member of the immunoglobulin gene superfamily. 

Immunoglobulin molecule: A protein containing the immunologically-active portions 
of an immunoglobulin heavy chain and immunoglobulin light chain covalently coupled 
together and capable of specifically combining with antigen. 

ICAM-1: Intercellular adhesion molecule- 1. La humans, ICAM-1 functions as the 
1 0 receptor for human rhinovirus. 

J chain: A polypeptide that is involved in the polymerization of immunoglobulins and 
transport of polymerized immunoglobulins through epithelial cells. See, The Immunoglobulin 
Helper: The J Chain in Immunoglobulin Genes, at pg. 345, Academic Press (1989). J chain is 
found in pentameric IgM and dimeric IgA andtypically attached via disulphide bonds; J 
1 5 chain has been studied in both mouse and human. 

Monocotyledonous plants (monocots): Flowering plants whose embryos have one 
cotyledon or seed leaf. Examples of monocots are: lilies; grasses; corn; grains, including oats, 
wheat and barley; orchids; irises; onions and palms. 

Glycosylation: The modification of a protein by oligosaccharides. See, Marshall, 
20 Ann. Rev. Biochem., 41 :673 (1972) and Marshall, Biochem. Soc. Symp., 40:17 (1974) for a 
general review of the polypeptide sequences that function as glycosylation signals. These 
signals are recognized in both mammalian and in plant cells. 

Plant-specific glycosylation: The glycosylation pattern found on plant-expressed 

proteins, which is different from that found in proteins made in mammalian or insect cells. 

25 Proteins expressed in plants or plant cells have a different pattern of glycosylation than do 

proteins expressed in other types of cells, including mammalian cells and insect cells. 
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Detailed studies characterizing plant-specific glycosylation and comparing it with 
glycosylation in other cell types have been performed by Cabanes-Macheteau et al. 9 
Glycobiology 9(4):365-372 (1999), Lerouge etal, Plant Molecular Biology 38:31-48 (1998) 
and Altmann, Glycoconjugate 1 14:643-646 (1997). Plant-specific glycosylation generates 
5 glycans that have xylose linked P(l,2) to mannose. Neither mammalian nor insect 

glycosylation generate xylose linked P(l,2) to mannose. Plants do not have a sialic acid 
linked to the terminus of the glycan, whereas mammalian cells do. In addition^ plant-specific 
glycosylation results in a fucose linked cc(l,3) to the proximal GlcNAc, while glycosylation in 
mammalian cells results in a fucose linked a(l,6) to the proximal GlcNAc. 

10 Secretory component (SC): A component of secretory immunoglobulins that helps to 

protect the immunoglobulin against inactivating agents thereby increasing the biological 
effectiveness of secretory immunoglobulin. The secretory component may be from any 
mammal or rodent including mouse or human. 

sICAM: A naturally-ocxurring solubletomcated form of ICAM-1 lacking both the. 
1 5 hydrophobic transmembrane domain and the carboxy-termindl cytoplasmic domain of ICAM. 

The articles, patents and patent applications cited in this document are incorporated 
into this document as if set forth in full. 

B. Immunoadhesins Containing Chimeric ICAM Molecules. 

The present invention provides novel methods for producing immunoadhesin 
20 molecules containing chimeric ICAM molecules. The immunoadhesins of the present 
invention contain chimeric ICAM-1 molecules made up of a rhino virus receptor protein 
linked to a portion of an immunoglobulin heavy chain molecule in association with J chain 
and secretory component The chimeric ICAM-1 molecules of the present invention contain 
two molecules derived from different sources: a rhinovirus receptor protein portion and an 
25 immunoglobulin chain portion. The rhinovirus receptor protein of the present invention is 
derived from the intercellular adhesion molecule 1 (ICAM-1). The nucleotide sequence for 
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the human rhinovirus receptor ICAM-1 has been determined and characterized by Staunton, et 
al., Cell 52:925-933 (1988); Greve, ef a/. Cell 56:839-847 (1989); Greve, etal.J. Virology 
65:6015-6023 (1991); Staunton, etal, Cell, 61:243-254 (1990) and described in Sequence ID 
No. 3 and GenBank accession no. M24283. 

The ICAM-1 molecule is a membrane protein (SEQ ID NOS: 1 and 2) that has 5 
extracellular domains, a hydrophobic transmembrane domain and a short cytoplasmic domain. 
These features have been described by Casasnovas, et al, Proc. Natl. Acad. Set U.S.A., 
95:4134-4139 (1998) and Staunton, et al, Cell 52:925-933 (1988). Of particular use in the 
present invention are the domains of the ICAM-1 molecule that are responsible for the 
binding of human rhinoviruses which have been localized to the N-terminal domains 1 and 2 
(Greve, et al, J. Virol., 65:6015-6023 1991, and Staunton, et al, Cell, 61 :243-245 1990. The 
present invention also contemplates rhinovirus receptor protein portions which include any 
combination of extracellular domains U 2, 3, 4, and 5 of the ICAM-1 molecule. In particular 
preferred embodiments, the rhinovirus receptor protein portion includes domains 1 and 2 of 
the ICAM-1 molecule and in other preferred embodiments domains 1, 2, 3 ; 4 and-5 of the . -.: 
ICAM-1 molecule are present. 

The boundaries of the 5 extracellular domains are well known in the art and described 
in Staunton, et al,, Cell 52:925-933 (1988). The approximated domain boundaries are shown 
in Table 1 below. 



Table 1 



ICAM-1 Domains 



Amino Acids 



1 



1-88 



2 



89-105 



3 



106-284 



4 



285-385 



5 



386-453 
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As used in the present invention, the ICAM-1 domain 1 is from about residue 1 to 
about residue 88; domain 2 is from about residue 89 to about residue 105; domain 3 is from 
about residue 106 to about residue 284; domain 4 is from about residue 285 to about 385; and 
domain 5 is from about residue 386 to 453. One of skill in the art will understand that the 
5 exact boundaries of these domains may vary. 

The chimeric ICAM-1 molecules of the present invention preferably contain at least a 
portion of an IgM or IgA heavy chain which allows that immunoglobulin heavy chain to bind 
to immunoglobulin J chain and thereby binds to the secretory component. It is contemplated 
that the portion of the chimeric ICAM-1 molecule derived from the immunoglobulin heavy 

10 chain of the present invention may be comprised of individual domains selected from the IgA 
heavy chain or the IgM heavy chain or from some other isotype of heavy chain. It is also 
contemplated that an immunoglobulin domain derived from an immunoglobulin heavy chain 
other than IgA or IgM or from an immunoglobulin light chain may be molecularly engineered 
to bind immunoglobulin J chain and thus may be used to produce immunoglobulins of the 

15 present invention. < 

One skilled in the art will understand that immunoglobulins consist of domains which 
are approximately 100-1 10 amino acid residues. These various domains are Ayell known in 
the art and have known boundaries. The removal of a single domain and its replacement with 
a domain of another antibody molecule is easily achieved with modern molecular biology. _ 

20 The domains are globular structures which are stabilized by intrachain disulfide bonds. This 
confers a discrete shape and makes the domains a self-contained unit that can be replaced or 
interchanged with other similarly shaped domains.. The heavy chain constant region domains 
of the immunoglobulins confer various properties known as antibody effector functions on a 
particular molecule containing that domain. Example effector functions include complement 

25 fixation, placental transfer, binding to staphyloccal protein, binding to streptococcal protein 
G, binding to mononuclear cells, neutrophils or mast cells and basophils. The association of 
particular domains and particular immunoglobulin isotypes with these effector functions is 
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well known and for example, described in Immunology, Roitt et aL, Mosby St. Louis, Mo. 
(1993 3rd Ed.) 

One of skill in the art will be able to identify immunoglobulin heavy chain constant 
region sequences. For example, a number of immunoglobulin DNA and protein sequences 
5 are available through GenBank. Table 2 shows the GenBank Accession numbers of 

immunoglobulin heavy chain genes and the proteins encoded by the genes. The sequences 
listed in Table 2 are shown in Fig. 7. 



Table 2 



GENBANK 
ACCESSION NO. 


HUMAN IMMUNOGLOBULIN SEQUENCE NAME 


SEQ ID NO. 


J00220 


Ig a i Heavy Chain Constant Region Coding Sequence 


15 


J00220 


Tt* * Keaw Chain Constant Region Amino Acid Seouence 


16 




TaA« TImvv Chain Constant Region Codine Seauence 


17 




TcrA* Chain Constant T?**aion Amino Acid Seauence 


18 


J00228 


Ig y i Heavy Chain Constant Region Coding Sequence 


19 


J00228 


Ig y i Heavy Chain Constant Region Amino Acid Sequence 


20 


J00230 
V00554 


IgG 2 Heavy Chain Constant Region Coding Sequence 




J00230 
V00554 


IgG 2 Heavy Chain Constant Region Amino Acid Sequence 


22 


X03604 
M12958 . 


IgG 3 Heavy Chain Constant Region Coding Sequence 


23 


X03604 
M12958 


IgG 3 Heavy Chain Constant Region Amino Acid Sequence 


24 


K01316 


IgG 4 Heavy Chain Constant Region Coding Sequence 


25 


K01316 


IgG 4 Heavy Chain Constant Region Amino Acid Sequence 


26 


K02876 


IgD Heavy Chain Constant Region Coding Sequence 


27 


K02876 


IgD Heavy Chain Constant Region Acid Sequence 


28 


K02877 


IgD Heavy Chain Constant Region Coding Sequence 


29 


K02877 


IgD Heavy Chain Constant Region Amino Acid Sequence 


30 
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GENBANK 
ACCESSION NO. 


HUMAN IMMUNOGLOBULIN SEQUENCE NAME 


SEQIDNO. 


K02878 


Germline IgD Heavy Chain Coding Sequence 


31 


K02878 


Germline IgD Heavy Chain Amino Acid Sequence 


32 


K02879 


Germline IgD Heavy Chain C-5-3 Domain Coding 
Sequence 


33 


K02879 


Gennline IgD Heavy Chain C-S-3 Amino Acid Sequence 


34 


K013U 


Gennline IgD Heavy Chain J-5 Region: C-5 CHI Amino 
Acid Sequence 


35 


K02880 


Germline IgD Heavy Chain Gene, C-Region, Secreted 
Terminus Coding Sequence 


36 


K02880 


Germline IgD Heavy Chain Gene, C-Region, Secreted 
Terminus Amino Acid Sequence 


37 


K02881 


Germline IgD-Heavy Chain Gene, C-Region, First 
Domain of Membrane Terminus Coding Sequence 


38 


K02881 


Gerrnline IgD-Heavy Chain Gene, C-Region, First 
Domain of Membrane Terminus Amino Acid Sequence 


39 


K02882 


Gennline IgD Heavy Chain Coding Sequence 


... - --40-- 


K02882 


Germline IgD Heavy Chain Amino Acid Sequence 


41 * 


K02875 


Germline IgD Heavy Chain Gene, C-Region, C-6-1 
Domain Coding Sequence 


42 


K02875 


Germline IgD Heavy Chain Gene, C-Region, C-5-1 
Domain Amino Acid Sequence 


43 


L00022 
J00227 
V00555 


IgE Heavy Chain Constant Region Coding Sequence 


44 


L00022 
J00227 
V00555 


IgE Heavy Chain Constant Region Amino Acid Sequence 


45 


X17115 


IgM Heavy Chain Complete Sequence Coding Sequence 


46 


X17115 


IgM Heavy Chain Complete Sequence mRNA 


47 
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The immunoadhesins of the present invention may, in addition to the chimeric 
ICAM-1 molecule, contain immunoglobulin light chains, or immunoglobulin J chain bound to 
the immunoglobulin derived heavy chains. In preferred embodiments, the immunoadhesin of 
the present invention comprises two or four chimeric ICAM-1 molecules and an 

5 immunoglobulin J chain bound to at least one of the chimeric ICAM-1 molecules. The J 
chain is described and known in the art. See, for example, M. Koshland, The 
Immunoglobulin Helper: The J Chain, in Immunoglobulin Genes, Academic Press, London, 
pg. 345, (1989) and Matsuuchi et al. y Proc. Natl Acad. ScL U.S.A., 83:456-460 (1986). The 
sequenceof the immunoglobulin J chain is available on various databases in the United 

10 States. 

The immunoadhesin of the present invention may have a secretory component 
associated with the chimeric ICAM-1 molecule. This association may occur by hydrogen 
bonds, disulfide bonds, covalent bonds, ionic interactions or combinations of these various 
bonds. Typically, chimeric ICAM-1 molecules are held together by disulfide bonds between 
15 the molecules. The interaction of the chimeric ICAM-1 molecules may be non-covalent or 
disulfide bonding. The present invention contemplates the use of secretory component from a 
number of different species, including human, rat, rabbit, bovine and the like. The nucleotide 
sequences for these molecules are well known in the art. For example, U.S. Patent 6,046,037 
contains many of the sequences and this patent is incorporated herein by reference. 

20 The immunoadhesins of the present invention containing the secretory component, the 

chimeric ICAM-1 molecule and J chain are typically bonded together by one of the following: 
hydrogen bonds, disulfide bonds, covalent bonds, ionic interactions or combinations of these 
bonds. 

The present invention also contemplates immunoadhesins which comprise more than 
25 one chimeric ICAM-1 molecule. The immunoadhesin may contain chimeric ICAM-1 
molecules that are monomeric units and not disulfide bonded to other chimeric ICAM-1 
molecules. In preferred embodiments, the immunoadhesion does contain chimeric ICAM-1 
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molecules that are in association with other chimeric ICAM-1 molecules to form dimers and 
other multivalent molecules. Typically the chimeric ICAM-1 molecule is present as a dimer 
because of the association of the immunoglobulin portion of the chimeric molecule. The 
immunoglobulin portion of the chimeric ICAM-1 molecule allows the association of two 
5 chimeric ICAM-1 molecules to form a dimeric molecule having two active binding portions 
made up of the rhinovirus receptor protein portion. In preferred embodiments, dimerization 
occurs via the disulfide bonding regions that normally occur between the immunoglobulin * 
domains as part of a natoally-occurring immunoglobulin molecule and the native 
immunoglobulin protein. One of skill in the art will understand that these disulfide bonds that 
10 are normally present in the native immunoglobulin molecule can be modified, moved and 
removed while still maintaining the ability to form a dimer of the chimeric ICAM-1 
molecules. 

In other preferred embodiments, the immunoadhesin contains multimeric forms of the 
chimeric ICAM-1 molecule due to the association of J chain with the immunoglobulin portion 

15 of the chimeric ICAM molecule. The association of J chain.with the dimer of two chimeric 
ICAM-1 molecules allows the formation of tetrameric forms of the immunoadhesin. in a 
preferred embodiment, the immunoglobulin portion of the chimeric ICAM-1 molecule is 
derived from the IgA molecule, and the addition of J chain allows the formation of a 
tetrameric complex containing four chimeric ICAM-1 molecules and four binding sites. In 

20 other preferred embodiments, the immunoglobulin heavy-chain portion of the chimeric 

molecule is derived from IgM and multivalent complexes containing ten or twelve molecules 
may be formed. In other preferred embodiments, in which the chimeric ICAM-1 molecule 
uses a chimeric immunoglobulin heavy-chain, the chimeric ICAM-1 molecule may form 
dimers or other higher order multivalent complexes through the domains from either IgA or 

25 IgM that are responsible for J chain binding. In other chimeric immunoglobulin molecules 

the portions of the immunoglobulin responsible for the disulfide bonding between the two 

immunoglobulin heavy-chains and/or the disulfide bonding between an immunoglobulin 

light-chain and heavy-chain maybe placed in the chimeric immunoglobulin molecule to allow 

the formation of dimers or other high order multivalent complexes. 
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The present invention contemplates immunoadhesins containing a chimeric ICAM-1 
molecule in which the immunoglobulin domains comprising the heavy chain are derived from 
different isotypes of either heavy or light chain immunoglobulins. One skilled in the art will 
understand that using molecular techniques, these domains can be substituted for a similar 
5 domain and thus produce an immunoglobulin that is a hybrid between two different 
immunoglobulin molecules. These chimeric immunoglobulins allow immunoadhesins 
containing secretory component to be constructed that contain a variety of different and . 
desirable properties that are conferred by different immunoglobulin domains. 

The present invention also contemplates chimeric ICAM-1 molecules in which the 
1 0 portion of the chimeric molecule derived from immunoglobulin, heavy or light J chain may 
contain less than an entire domain derived from a different immunoglobulin molecule. The 
same molecular techniques may be employed to produce such chimeric ICAM-1 molecules. 

In preferred embodiments, the chimeric ICAM-1 molecules of the present invention 
contain at least the C H 1, C H 2, C H 3, domain of mouse or human IgAi, IgA 2 or IgM. "Other 
1 5 preferred embodiments of the present invention contain immunoglobulin domains that include 
at least the Qil, Cji2, Qi3, or C|x4 domains of IgM. 

Preferred chimeric ICAM-1 molecules contain domains from two different isotypes of 
human immunoglobulin. Preferred chimeric ICAM-1 molecules that include 
immunoglobulins that contain immunoglobulin domains including at least the ChI, Ch2, or 

20 C H 3 of human IgG, IgGl, IgG2, IgG3, IgG4, IgAl, IgA2, IgE, or IgD. Other prefeired 
immunoglobulins for use as part of chimeric ICAM-1 molecules include immunoglobulins 
that contain domains from at least the ChI, C h 2, C h 3, or C H 4 domain of IgM or IgE. The 
present invention also contemplates chimeric ICAM-1 molecules that contain 
immunoglobulin domains derived from at least two different isotypes of mammalian 

25 immunoglobulins. Generally, any of the mammalian immunoglobulins can be used in the 
prefeired embodiments, such as the following isotypes: any isotype of IgG, any isotype of 
IgA, IgE, IgD or IgM. The present invention also contemplates chimeric ICAM-1 molecules 
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derived from a species such as human, mouse or other mammals. In preferred embodiments, 
the chimeric ICAM-1 molecule contains the portion of IgA or IgM responsible for the 
association of J chain with the IgA and IgM. Thus, by using a chimeric immunoglobulin in 
the chimeric ICAM-1 molecule, the J chain may associate with a chimeric immunoglobulin 
5 that is predominantly of an isotype that does not bind J chain or secretory component. 

The present invention also contemplates chimeric ICAM-1 molecules that contain 
immunoglobulin domains derived from two different isotypes of rodent or primate 
immunoglobulin. The isotypes of rodent or primate immunoglobulin are well known in the 
art. The chimeric ICAM-1 molecules of the present invention may contain immunoglobulin 

10 derived heavy chains that include at least one of the following immunoglobulin domains: the 
C H 1, C H 2, or C H 3 domains of a mouse IgG, IgGl, IgG2a, IgG2b, IgG3, IgA, IgE, or IgD; the 
C H 1 , C H 2, C H 3 or C H 4 domain of mouse IgE or IgM; the C H 1 domain of a human IgG, IgGl , 
IgG2, IgG3, IgG4, IgAl, IgA2, or IgD; the C H 1, C H 2, C H 3, C H 4 domain of human IgM or 
IgE; the C H 1, Ch2, or C H 3 domain of an isotype of mammalian IgG, an isotype of IgA, IgE, or 

15 IgD; the C H 1, C H 2, C H 3 or C H 4 domain of a mammalian IgE or IgM; the C H 1, C H 2 , or C H 3 
domain of an isotype of rodent IgG, IgA, IgE, or IgD; the C H 1» C H 2, C H 3 or C H 4 domain of a 
rodent IgE or IgM; the ChI, Ch2, or C H 3 domain of an isotype of animal IgG, an isotype of 
IgA, IgE, or IgP; and the C H 1, C H 2, C H 3, or C H 4 domain of an animal IgE or IgM. The 
present invention also contemplates the replacement or addition of protein domains derived 

20 from molecules that are members of the immunoglobulin superfamily into the chimeric 
ICAM-1 molecules. The molecules that belong to the immunoglobulin superfamily have 
amino acid residue sequence and nucleic acid sequence homology to immunoglobulins. The 
molecules that are part of the immunoglobulin superfamily can be identified by amino acid or 
nucleic acid sequence homology. See, for example, p. 361 of Immunoglobulin Genes, 

25 Academic Press (1989). 

In preferred embodiments of the present invention, the immunoadhesin is expressed 
by methods that generate an immunoadhesin having plant-specific glycosylation. It is * 
well-known in the art that glycosylation is a major modification of proteins in plant cells 
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(Lerouge et al, Plant Molecular Biology 38:31-48, 1998). Glycosylation of proteins also 
occurs in other cell types, including mammalian and insect cells. The glycosylation process 
starts in the endoplasmic reticulum by the co-translational transfer of a precursor 
oligosaccharide to specific residues of the nascent polypeptide chain. Processing of this 
oligosaccharide into different types of glycans, which differ in the types of residues present 
and the nature of the linkages between the residues, occurs in the secretory pathway as the 
glycoprotein moves from the endoplasmic reticulum to its final destination. One of skill in 
the art will understand that at the end of their maturation, proteins expressed in plants or plant 
cells have a different pattern of glycosylation than do proteins expressed in other types of 
cells, including mammalian cells and insect cells. Detailed studies characterizing 
plant-specific glycosylation and comparing it with glycosylation in other cell types have been 
performed, for example, in studies described by Cabanes-Macheteau et al, Glycobiology 
9(4):365-372 (1999), and Altmann, Glycoconjugate J. 14:643-646 (1997). These groups and 
others have shown that plant-specific glycosylation generates glycans that have xylose linked 
(3(1,2) to mannose, but xylose is not linked 0(1,2) to mannose as a result of glycosylation in 
mammalian and insect cells. Plant-specific glycosylation results in a fiicose linked cc(l,3) to - 
the proximal GlcNAc, while glycosylation in mammalian cells results in a fiicose linked 
cc(l ,6) to the proximal GlcNAc. Furthermore, plant-specific glycosylation does not result in 
the addition of a sialic acid to the terminus of the protein glycan, whereas in glycosylation in 
mammalian cells, sialic acid is added. 

The immunoadhesin of the present invention that is glycosylated in a plant-specific 
manner can contain a chimeric ICAM-1 molecule that includes any combination of 
extracellular domains 1, 2, 3, 4, and 5 of the ICAM-1 molecule. FIG. 2B shows the amino 
acid sequence of the chimeric ICAM-l/IgA2 molecule (SEQ ED NO: 8) of the present 
invention, that contains all five domains of ICAM-1. The bolded N's represent asparagine 
residues to which oligosaccharide moieties are linked during glycosylation in plant cells, as 
well as mammalian and insect cells. One of skill in the art will know that the glycosylation 
sites are the tripeptide Asn-X-Ser/Thr where X can be any amino acid except proline and 



24 



WO 01/83529 PCT/US01/13932 

aspartic acid (Kornfeld and Kornfeld, Annu Rev Biochem 54:631-664, 1985). It will therefore 
be known to one of skill in the art that which amino acids of the protein having plant-specific 
glycosylation would depend on which domains of ICAM-1 are present. Because the sequence 
and domain boundaries of ICAM-1 are known (see Staunton et al. 9 Cell 52:925-933, 1988), it 
5 would be evident to one of skill in the art how to determine the plant-specific glycosylation 
sites on any potential combination of any of the five ICAM-1 domains. 

In other preferred embodiments of the present invention, the immunoadhesin having 
plant-specific glycosylation and containing a chimeric ICAM-1 molecule having any 
combination of ICAM-1 extracellular domains 1, 2, 3, 4 and 5 further comprises a J chain and 
10 secretory component associated with said chimeric ICAM-1 molecule. As was true with 
respect to the chimeric ICAM- 1 molecule, one of skill in the art will be able to identify the , 
sites for plant-specific glycosylation in the J chain and secretory component sequences. 

The present invention contemplates immunoadhesins having plant-specific 
glycosylation, that contain a chimeric ICAM-1 molecule in which the immunoglobulin heavy 

15 chain is selected from the group ofIgA(SEQ ID NOS: 15-18), IgAi(SEQ ID NOS: 15-16), 
IgA 2 (SEQEDNO: 17), IgGi (SEQ ID NOS: 19-20), IgG 2 (SEQ ID NOS: 21-22), IgG 3 (SEQ 
ID NOS: 23-24), IgG 4 (SEQ ID NOS: 25-26), IgM (SEQ ID NOS: 46-47), IgD (SEQ ID 
NO: 27-32 and 36-41-43), IgE (SEQ ID NOS: 44-45), and a chimeric immunoglobulin 
heavy chain. One of skill in the art will know that which of these immunoglobulin heavy 

20 chain sequences, or which combination of immunoglobulin heavy chain sequences are 

combined in a chimeric immunoglobulin heavy chain, will have an effect on the number and 
location of glycosylation sites in the chimeric ICAM-1 molecule of the immunoadhesin. As 
was true with respect to the chimeric ICAM-1 molecule, one of skill in the art will be able to 
identify the sites for plant-specific glycosylation in the immunoglobulin heavy chain 

25 sequences, including the various chimeric immunoglobulin heavy chain sequences that can be 
constructed. 
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Also provided herein are immunoadhesin functional derivatives. By "functional 
derivative" is meant a "chemical derivative," "fragment," or 'Variant," of the polypeptide or 
nucleic acid of the invention which retains at least a portion of the function of the protein, for 
example reactivity with an antibody specific for the protein, enzymatic activity or binding 
activity, which permits its utility in accordance with the present invention. It is well known in 
the art that due to the degeneracy of the genetic code numerous different nucleic acid 
sequences can code for the same amino acid sequence. It is also well known in the art that 
conservative changes in amino acid can be made to arrive at a protein or polypeptide that, 
retains the functionality of the original. In both cases, all permutations are intended to be 
covered by this disclosure. 

The derivatives may also be engineered according to routine methods to include an 
affinity purification tag such that large quantities and/or relatively pure or isolated quantities 
of immunoadhesin may be produced. Many different versions of tag exist that can be 
incorporated into one or more components of the immunoadhesin, preferably not destroying 
the desired binding activity of the immunoadhesin in the absence of tag; Such tags can be- 
engineered as expressible encoded nucleic acid sequence fused with nucleic acidsequences 
encoding the immunoadhesins of the invention. The tags may further be engineered to be 
removable, e.g., with a commercially available enzyme. 

Further, it is possible to delete codons or to substitute one or more codons with codons' 
other than degenerate codons to produce a structurally modified polypeptide, but one which 
has substantially the same utility activity as the polypeptide produced by the unmodified 
nucleic acid molecule. As recognized in the art, the two polypeptides can be functionally 
equivalent, as are the two nucleic acid molecules that give rise to their production, even 
though the differences between the nucleic acid molecules are not related to the degeneracy of 
the genetic code. 

Manipulations of this sort, and post-production chemical derivatization may be 
implemented, to improve stability, solubility, absorption, biological or therapeutic effect, 
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and/or biological half-life. Moieties capable of mediating such effects are disclosed, for 
example, in Remington's Pharmaceutical Sciences, 18th ed., Mack Publishing Co., Easton, 
PA (1990). A functional derivative intended to be within the scope of the present invention is 
a "variant* * polypeptide which either lacks one or more amino acids or contains additional or 

5 substituted amino acids relative to the native polypeptide. The variant may be derived from a 
naturally occurring complex component by appropriately modifying the protein DNA coding 
sequence to add, remove, and/or to modify codons for one or more amino acids at one or more 
sites of the C-terminus, N-terminus, and/or within the native sequence. It is understood that 
such variants having added, substituted and/or additional amino acids retain one or more 

10 characterizing portions of the native protein, as described above. 

A functional derivative of a protein with deleted, inserted and/or substituted amino 
acid residues may be prepared using standard techniques well-known to those of ordinary skill 
in the art. For example, the modified components of the functional derivatives may be 
produced using site-directed mutagenesis techniques (as exemplified by Adehnan et al, 

15 1983, DNA 2:183) wherein nucleotides in the DNA coding sequence are modified such that a 
modified coding sequence is produced, and thereafter expressing this recombinant DNA in a 
prokaryotic or eukaryotic host cell, using techniques such as those described above. 
Alternatively, proteins with amino acid deletions, insertions and/or substitutions may be 
conveniently prepared by direct chemical synthesis, using methods well-known in the art. 

20 The functional derivatives of the proteins typically exhibit the same qualitative biological 
activity as the native proteins. 

In addition, the immunoadhesins of the invention may be not just modified ICAM-l/Ig 
immunoadhesins, but may also embrace other native ICAM family members, isotypes, and/or 
other homologous amino acid sequences, e.g. f human, primate, rodent, canine, feline, bovine, 
25 avian, etc. Furthermore, the Ig type used in the immunoadhesins can vary, eg., may assume a 
different Ig family member identity, within or without a given species. ICAMs and Igs are 
diverse and have well-known sequences that one of ordinary skill can exploit to create 
different immunoadhesins having more or less different utility in a given organism to undergo 
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treatment. An illustrative, nonexhaustive list of examples of molecules having ICAM-1 
homology that can be used to create other immunoadhesins include those in the following 
table. 



Table 3 



ACCESSION NO. 


IC AM NAME 


SPECIES 


NP 000192 


Intercellular Adhesion Molecule-1 (CD54) 


Homo sapiens 


AAH03097 


Intercellular Adhesion Molecule ICAM-2 


Homo sapiens 


NP 002153 


Intercellular Adhesion Molecule 3 Precursor 


Homo sapiens 


BAB20325 


TCAM-1 


Homo sapiens 


NP 003250 


Intercellular Adhesion Molecule 5 (Telencephalin) 


Homo sapiens 


NM 007164 


Mucosal Vascular Address in Cell Adhesion Molecule 
(MADCAM1) 


Homo sapiens 


NM 001078 


Vascular Cell Adhesion Molecule 1 (VCAM1) 


Homo sapiens ' 


AAA37875 


MALA-2 


Mus musculus- — 


AAA37876 


Intercellular Adhesion Molecule-1 Precursor 


Mus musculus 


AAG30280 


Intracellular Adhesion Molecule 1 


Cricetulus griseus 


AAB39264 


Intercellular Adhesion Molecule-3 


Bos taurus 


AAF80287 


Intercellular Adhesion Molecule-1 Precursor 


Sus scrofa 


AAA18478 


Telecephalin 


Oryctolagus cuniculus 
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ACCESSION NO, 


ICAMNAME 


SPECIES 


NP 032345 


Intercellular Adhesion Molecule 5, telencephalin 


Mus musculus 


BAB41106 


Cell adhesion molecule TCAM-1 


Mus musculus 


NP 067705 


Testicular Cell Adhesion Molecule 1 


Rattus norvegicus 


AAG35584 


Nectin-Like Protein 1 


Mus musculus 


AAC18956 


CD22 Protein 


Homo sapiens 


AAA35415 


Intercellular Adhesion Molecule 1 


Pan troglodytes 


" AAA83206 


89 kDa Protein 


Mus musculus 


AAA92551 


Intercellular Adhesion Molecule- 1 


Canis familiaris 


AAB06749 


Intercellular Adhesion Molecule- 1 


Bos taurus 


AAD13617 


Intercellular Adhesion Molecule- 1 Precursor 


Ovis aries 


NP 037099 


Intercellular Adhesion Molecule- 1 


Rattus norvegicus 


AAE22202 


ICAM-4 


Rattus norvegicus 


AAA60392 


cell surface glycoprotein 


Homo sapiens 


AAF91086 


nephrin 


Rattus norvegicus 


AAF91087 


nephrin 


Mus musculus 
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Likewise, numerous heavy chain constant regions of different Ig molecules, both in 
humans and other species, are known that can be substituted in for those specific Ig regions of 
the chimeras described herein. 

C. Vectors, Cells and Plants Containing Immunoadhesins 

The present invention also contemplates expression and cloning vectors, cells and 
plants containing the immunoadhesins of the present invention. Technology for isolating the 
genes encoding the various portions of the immunoadhesions are well-known to one of skill in 
the art and can be applied to insert the various required genes into expression vectors;and 
cloning vectors such as those vectors can be introduced into cells and into transgenic plants. 

The present invention contemplates a method of assembling an immunoadhesin 
comprising the steps of: introducing into an organism a DNA segment encoding a chimeric 
ICAM-1 molecule, immunoglobulin J chain, and introducing into the same organism a DNA • 
encoding a secretory component. The preferred secretory component contains at least a 
segment of the amino acid residues 1 to residue about 606 of the human polyimmunoglobulin 
receptor (plgR) amino acid residue sequence or analogous amino acid residues from other 
species (Mostov, Ann Dev. Immu. 12:63-84 1994). 

The present invention contemplates eukaryotic cells, including plant cells, containing 
immunoadhesins of the present invention. The present invention also contemplates plant cells 
that contain nucleotide sequences encoding the various components of the immunoadhesin of 
the present invention. One skilled in the art will understand that the nucleotide sequences that 
encode the secretory component protection protein and the chimeric ICAM-1 molecule and J. 
chain will typically be operably linked to a promoter and present as part of an expression 
vector or cassette. Typically, if the eukaryotic cell used is a plant cell than the promoter used 
will be a promoter that is able to operate in a plant cell. After the chimeric ICAM-1 genes, 
secretory component genes and J chain genes are isolated, they are typically operatively 
linked to a transcriptional promoter in an expression vector. The present invention also 
contemplates expression vectors containing a nucleotide sequence encoding a chimeric 
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ICAM-1 molecule which has been operatively linked to a regulatory sequence for expression. 
These expression vectors place the nucleotide sequence to be expressed in a particular cell 3' 
of a promoter sequence which causes the nucleotide sequence to be transcribed and expressed. 
The expression vector may* also contaiji various enhancer sequences which improve the 
5 efficiency of this transcription. In addition, such sequences as terminators, polyadenylation 
(poly A) sites and other 3 ' end processing signals may be included to enhance the amount of 
nucleotide sequence transcribed within a particular cell. 

Expression of the components in the organism of choice can be derived from an 
independently replicating plasmid, or from a permanent component of the chromosome, or 

10 from any piece of DNA which may transiently give rise to transcripts encoding the 

components. Organisms suitable for transformation can be either prokaryotic or eukaryotic. 
Introduction of the components of the complex can be by direct DNA transformation, by 
biolistic delivery into the organism, or mediated by another organism as for example by the 
action of recombinant Agrobacterium on plant cells . Expression of proteins in transgenic 

1 5 organisms usually requires co-introduction of an appropriate promoter element and 

polyadenylation signal. In one embodiment of the invention, the promoter element potentially 
results in the constitutive expression of the components in all of the cells of a plant. 
Constitutive expression occurring in most or all of the cells will ensure that precursors can 
occupy the same cellular endomembrane system as might be required for assembly to occur. 

20 Expression vectors compatible with the host cells, preferably those compatible with 

plant cells are used to express the genes of the present invention. Typical expression vectors 
useful for expression of genes in plants are well known in the art and include vectors derived 
from the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens described by Rogers et 
al. 9 Meth. in EnzymoL, 153:253-277 (1987). However, several other expression vector 

25 systems are known to function in plants. See for example, Verma et al, PCT Publication No. 
WO87/00551; and Cocking and Davey, Science, 236:1259-1262 (1987). 
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The expression vectors described above contain expression control elements including 
the promoter. The genes to be expressed are operatively linked to the expression vector to 
allow the promoter sequence to direct RNA polymerase binding and synthesis of the desired 
polypeptide coding gene. Useful in expressing the genes are promoters which are inducible, 
5 viral, synthetic, constitutive, and regulated. The choice of which expression vector is used 
and ultimately to which promoter a nucleotide sequence encoding part of the immunoadhesin 
of the present invention is operatively linked depends directly, as is well known in the art, on 
the functional properties desired, e.g. the location and timing of protein expression, and the 
host cell to be transformed, these being limitations inherent in the art of constructing 
1 0 recombinant DNA molecules. However, an expression vector useful in practicing the present 
invention is at least capable of directing the replication, and preferably also the expression of 
the polypeptide coding gene included in the DNA segment to which it is operatively linked. 

In preferred embodiments, the expression vector used to express the genes includes a 
selection marker that is effective in a plant cell, preferably a drug resistance selection marker. 

15 A preferred drug resistance marker is the gene-whose expression results in kanamycin 

resistance, i.e., the chimeric gene containing the nopaline synthase promoter, Tn5 neomycin 
phosphotransferase II and nopaline synthase 3' nontranslated region described by Rogers et 
ah, in Methods For Plant Molecular Biology, a Weissbach and H. Weissbach, eds., Academic 
Press Inc., San Diego, Calif. (1988). A useful plant expression vector is commercially 

20 available from Pharmacia, Piscataway, NJ. Expression vectors and promoters" for expressing 
foreign proteins in plants have been described in U.S. Pat. Nos. 5,188,642; 5,349,124; 
5,352,605, and 5,034,322 which are hereby incorporated by reference. 

A variety of methods have been developed to operatively link DNA to vectors via 
complementary cohesive termini. For instance, complementary homopolymer tracks can be 
25 added to the DNA segment to be inserted into the vector DNA. The vector and DNA segment 
are then joined by hydrogen bonding between the complementary homopolymeric tails to 
form recombinant DNA molecules. Alternatively, synthetic linkers containing one or more 
restriction endonuclease sites can be used to join the DNA segment to the expression vector. 
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The synthetic linkers are attached to blunt-ended DNA segments by incubating the 
blunt-ended DNA segments with a large excess of synthetic linker molecules in the presence 
of an enzyme that is able to catalyze the ligation of blunt-ended DNA molecules, such as 
bacteriophase T4 DNA ligase. Thus, the products of the reaction are DNA segments carrying 
5 synthetic linker sequences at their ends. These DNA segments are then cleaved with the 
appropriate restriction endonuclease and ligated into an expression vector that has been 
cleaved with an enzyme that produces termini compatible with those of the synthetic linker. 
Synthetic linkers containing a variety of restriction endonuclease sites are commercially 
available from a number of sources including New England BioLabs, Beverly, Mass. 

10 The nucleotide sequences encoding the secretory component, J chain, the chimeric 

ICAM-1 molecules of the present invention are introduced into the same plant cell either 
directly or by introducing each of the components into a plant cell and regenerating a plant 
and cross-hybridizing the various components to produce the final plant cell containing all the 
required compcments. 

1 5 Any method may be used to introduce the nucleotide sequences encoding the 

components of the immunoadhesins of the present invention into a eukaryotic cell. For 
example, methods for introducing genes into plants include Agrobacterium-mediated plant 
transformation, protoplast transformation, gene transfer into pollen, injection into 
reproductive organs and injection into immature embryos. Each of these methods has distinct 

20 advantages and disadvantages. Thus, one particular method of introducing genes into a 
particular eukaryotic cell or plant species may not necessarily be the most effective for 
another eukaryotic cell or plant species. 

Agrobacterium tumefaciens-mediated transfer is a widely applicable system for 
introducing genes into plant cells because the DNA can be introduced into whole plant 
25 tissues, bypassing the need for regeneration of an intact plant from a protoplast. The use of 
Agrobacterium-mediaXed expression vectors to introduce DNA into plant cells is well known 
in the art. See, for example, the methods described by Fraley et al. 9 Biotechnology, 3:629 
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(1985) and Rogers et al, Methods in Enzymology, 153:253-277 (1987). Further, the 
integration of the Ti-DNA is a relatively precise process resulting in few rearrangements. The 
region of DNA to be transferred is defined by the border sequences and intervening DNA is 
usually inserted into the plant genome as described by Spielmann et al, Mol. Gen. Genet., 

5 205:34 (1986) and Jorgensen et al, Mol. Gen. Genet., 207:471 (1987). Modem 

Agrobacterium transformation vectors are capable of replication in Escherichia coli as well as 
Agrobacterium, allowing for convenient manipulations as described by Klee et al, in Plant 
DNA Infectious Agents, T. Hohn and J. Schell, eds., Springer-Verlag, New York,;pp, 179-203 
(1985). Further recent technological advances in vectors for Agrobacterium-mediated gene 

10 transfer have improved the arrangement of genes and restriction sites in the vectors to 

facilitate construction of vectors capable of expressing various polypeptide coding genes.. The 
vectors described by Rogers et al, Methods in Enzymology, 153:253 (1987), have convenient 
multi-linker regions flanked by a promoter and a polyadenylation site for direct expression of 
inserted polypeptide coding genes and are suitable for present purposes. 

15 In those plant species where Agrobacterium-va.edxaX.ed transformations efficient, it is 

the method of choice because of the facile and defined nature of the gene transfer. 
Agrobacterium-mediated transformation of leaf disks and other tissues appears to be limited 
to plant species that Agrobacterium tameiadens naturally infects. Thus, 
Agrobacterium-mediated transformation is most efficient in dicotyledonous plants. 

20 Few monocots appear to be natural hosts for Agrobacterium, although transgenic 

plants have been produced in asparagus using Agrobacterium vectors as described by 
Bytebier et al, Proc. Natl. Acad. Sci. USA., 84:5345 (1987). Therefore, commercially 
important cereal grains such as rice, corn, and wheat must be transformed using alternative 
methods. Transformation of plant protoplasts can be achieved using methods based on 

25 calcium phosphate precipitation, polyethylene glycol treatment, electroporation, and 
combinations of these treatments. See, for example, Potrykus et al, Mol. Gen. Genet, 
199:183 (1985); Lorz etal.Mol. Gen. Genet., 199:178 (1985); Fromm etal, Nature, 319:791 
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(1986); Uchimiya et aL, Mol. Gen. Genet, 204:204 (1986); Callis et aL, Genes and 
Development, 1:1 183 (1987); andMarcotte et aL, Nature, 335:454 (1988). 

Application of these methods to different plant species depends upon the ability to 
regenerate that particular plant species from protoplasts. Illustrative methods for the 
5 regeneration of cereals from protoplasts are described in Fujimura et ah, Plant Tissue Culture 
Letters, 2:74 (1985); Toriyama et aL, TheorAppL Genet., 73:16 (1986); Yamada et aL, Plant 
Cell Rep., 4:85 (1986); Abdullah et aL, Biotechnology, 4:1087 (1986). 

To transform plant species that cannot be successfully regenerated from protoplasts, 
other ways to introduce DNA into intact cells or tissues can be utilized. For example, 

1 0 regeneration of cereals from immature embryos or explants can be effected as described by 
Vasil, Biotechnology, 6:397 (1988). In addition, "particle gun" or high-velocity 
microprojectile technology can be utilized. Using such technology, DNA is carried through 
the cell wall and into the cytoplasm on the surface of small (0.525 nm) metal particles that 
have been accelerated to speeds of one to several hundred meters per second as described in 

15 Klein et aL, Nature, 327:70 (1987); Klein et aL, Proc. NatL Acad Set U.S.A., _85:8502 
(1988); and McCabe et aL, Biotechnology, 6:923 (1988). The metal particles penetrate 
through several layers of cells and thus allow the transformation of cells within tissue 
explants. Metal particles have been used to successfully transform corn cells and to produce 
fertile, stably transformed tobacco and soybean plants. Transformation of tissue explants 

20 eliminates the need for passage through a protoplast stage and thus speeds the production of 
transgenic plants. 

DNA can also be introduced into plants by direct DNA transfer into pollen as 
described by Zhou et aL, Methods in Enzymology, 101 :433 (1983); D. Hess, Intern Rev. 
CytoL, 107:367 (1987); Luo et aL, Plant Mol. Biol. Reporter, 6:165 (1988). Expression of 
25 polypeptide coding genes can be obtained by injection of the DNA into reproductive organs 
of a plant as described by Pena et aL, Nature, 325:274 (1987). DNA can also be injected 
directly into the cells of immature embryos and the rehydration of desiccated embryos as 
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described by Neuhaus et aL, Theor. Appl Genet, 75:30 (1987); and Benbrook et aL, in 
Proceedings Bio Expo 1986, Butterworth, Stoneham, Mass., pp. 27-54 (1986). 

The regeneration of plants from either single plant protoplasts or various explants is 
well known in the art. See, for example, Methods for Plant Molecular Biology, A. Weissbach 
5 and H. Weissbach, eds., Academic Press, Inc., San Diego, Calif. (1988). This regeneration 
and growth process includes the steps of selection of transformant cells and shoots, rooting 
the transformant shoots and growth of the plantlets in soil. 

The regeneration of plants containing the foreign gene introduced by Agrobacterium 
tumefaciens from leaf explants can be achieved as described by Horsch et aL, Science, 

10 227:1229-1231 (1985). In this procedure, transformants are grown in the presence of a 
selection agent and in a medium that induces the regeneration of shoots in the plant species 
being transformed as described by Fraley et aL, Proc. NatL Acad. ScL U.S.A., 80:4803 (1983). 
This procedure typically produces shoots within two to four weeks and these transformant 
shoots are then transferred to an appropriate root-inducing medium containing the selective 

15 agent and an antibiotic to prevent bacterial growth. Transformant shoots that rooted in the 
presence of the selective agent to form plantlets are then transplanted to soil to allow the 
production of roots, these procedures will vary depending upon the particular plant species 
employed, such variations being well known in the art. ... 

The immunoadhesins of the present invention may be produced in any plant cell 
20 including plant cells derived from plants that are dicotyledonous or monocotyledonous, 
solanaceous, alfalfa, legumes, or tobacco. 

Transgenic plants of the present invention can be produced from any sexually 
crossable plant species that can be transformed using any method known to those skilled in 
the art. Useful plant species are dicotyledons including tobacco, tomato, the legumes, alfalfa, 
25 oaks, and maples; monocotyledons including grasses, corn, grains, oats, wheat, and barley; 
and lower plants including gymnosperms, conifers, horsetails, club mosses, liverworts, 
hornworts, mosses, algaes, gametophytes, sporophytes or pteridophytes. 
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The present invention also contemplates expressing the immunoadhesins within 
eukaryotic cells including mammalian cells. One of skill in the art will understand the various 
systems available for expression of the immunoadhesin in mammalian cells and can readily 
modify those system to express the immunoadhesions and chimeric ICAM-1 molecules of the 

5 present invention in those cells. In preferred embodiments, the chimeric ICAM-1, J chain and 
secretory component molecules of the present invention are placed in a vector pCDM8 which 
has been previously described by Aruffo, et al, Proa Natl. Acad. ScL U.S.A, 84:8573-8577 
(1987). The use of the PCDM8 construct is by no means unique and numerous other systems 
are available that do not utilize the cog cell expression system. For example, the following 

10 United States Patents describe useful eukaryotic expression systems that may be used with the 
chimeric ICAM-1 and other molecules of the immunoadhesin. 

D. Compositions Containing Immunoadhesins 

The present invention also contemplates compositions containing an immunoadhesin 
of the present invention together with plant macromolecules or material. Typically these 

15 plant macromolecules or plant materials are derived from any plant useful in the present 
invention. The plant macromolecules are present together with an immunoadhesin of the 
present invention for example, in a plant cell, in an extract of a plant cell, or in" a plant. 
Typical plant macromolecules associated with the immunoadhesin of the present invention in 
a composition are ribulose bisphosphate carboxylase, light harvesting complex pigments 

20 (LHCP), secondary metabolites or chlorophyll. The compositions of the present invention 
have plant material or plant macromolecules in a concentration of between 0.01% and 99% 
mass excluding water. Other compositions include compositions having the immunoadhesins 
of the present invention present at a concentration of between 1% and 99% mass excluding 
water. Other compositions include immunoadhesins at a concentration of 50% to 90% mass 

25 excluding water. 

The compositions of the present invention may contain plant macromolecules at a 

concentration of between 0.1% and 5% mass excluding water. Typically the mass present in 

the composition will consist of plant macromolecules and immunoadhesins of the present 
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invention. When the immunoadhesins of the present invention are present at a higher or 
lower concentration the concentration of plant macromolecules present in the composition 
will vary inversely. In other embodiments the composition of plant macromolecules are 
present in a concentration of between 0. 12% and 1 % mass excluding water. 

The present invention contemplates a composition of matter comprising all or part of 
the following: a chimeric ICAM-l molecule, a J chain or a secretory component. These 
components form a complex and are associated as was previously described. Typically, the 
composition also contains molecules derived from a plant. This composition may also be 
obtained after an extraction process yielding functional immunoadhesin and plant-derived 
molecules. 

The extraction method comprises the steps of applying a force to a plant containing 
the complex whereby the apoplastic compartment of the plant is ruptured releasing said 
complex. The force involves shearing as the primary method of releasing the apoplastic 
liquid. " 

The whole plant or plant extract contains an admixture of immunoadhesin and various 
other macromolecules of the plant. Among the macromolecules contained in the admixture is 
ribulose bisphosphate carboxylase (RuBisCo) or fragments of RuBisCo. Another 
macromolecule is LHCP. Another molecule is chlorophyll. 

Other useful methods for preparing compositions containing immunoadhesins having 
chimeric ICAM-1 molecule include extraction with various solvents and application of 
vacuum to the plant material. The compositions of the present invention may contain plant 
macromolecules in a concentration of between about 0.1% and 5% mass excluding water. 

The present invention also contemplates therapeutic compositions which may be used 
in the treatment of a patient or animal. Administration of the therapeutic composition can be 
before or after extraction from the plant or other transgenic organism. Once extracted the 
immunoadhesins may also be further purified by conventional techniques such as size 
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exclusion, ion exchange, or affinity chromatography. Plant molecules may be 
co-administered with the complex. 

The present invention also contemplates that the relative proportion of plant-derived 
molecules and animal-derived molecules can vary. Quantities of specific plant proteins, such 
5 as RuBisCo or chlorophyll may be as little as 0.0 1 % of the mass or as much as 99.9% of the 
mass of the extract, excluding water. 

The present invention also contemplates the direct use of the therapeutic plant extract 
containing immunoadhesins without any further purification of the specific therapeutic 
component. Administration may be by topical application, oral ingestion, nasal spray or any 
1 0 other method appropriate for delivering the antibody to the mucosal target pathogen. 

E. Pharmaceutical Compositions, Formulations, And Routes Of Administration 

The immunoadhesins described herein can be administered to a patient, preferably in 
the form of a suitable pharmaceutical composition. Such composition may include 
components in addition to, or in lieu of, those described above. The composition preferably 

15 exhibits either or both of a therapeutic and prophylactic property when administered. The 
preparation of such compositions can be done according to routine methodologies in the art, 
and may assume any of a variety of forms, e.g., liquid solutions, suspensions or 
emulsifications, and solid forms suitable for inclusion in a liquid prior to ingestion. 
Techniques for the formulation and administration of polypeptides and proteins may be found 

20 in Remington 's Pharmaceutical Sciences, Mack Publishing Co., Easton, PA, latest edition. 
Using these procedures, one of ordinary skill can utilize the immunoadhesins of the invention 
to achieve success without undue experimentation. 

1 . Administration Routes 

Suitable routes of administration for the invention include, e.g., oral, nasal, inhalation, 
25 intraocular, phanyngeal, bronchial, transmucosal, or intestinal administration. Alternatively, 
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one may administer the compound in a local manner, e.g. 9 via injection or other application of 
the compound to a preferred site of action. 

2. Formulations 

The pharmaceutical compositions of the present invention may be manufactured in a 
5 manner that is itself known, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. 
One or more physiologically acceptable carriers comprising excipients and/or other auxiliaries 
can be used to facilitate processing of the active compounds into pharmaceutical preparations. 
Proper formulation is dependent upon the particular route of administration chosen. 

10 For injection, the agents of the invention may be formulated in aqueous solutions, - 

preferably in physiologically compatible buffers such as Hanks' s solution, Ringer's solution, 
or physiological saline buffer. For transmucosal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. Such penetrants are generally known in 
the art. * - 

15 For oral 'administration, the compounds can be formulated readily by combining the .* 

active compounds with pharmaceutical^ acceptable carriers well known in the art. Such 
carriers enable the compounds of the invention to be formulated as tablets, pills, dragees, 
capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient 
to be treated. Suitable carriers include excipients such as, e.g., fillers such as sugars, 

20 including lactose, sucrose, mannitol, and/or sorbitol; cellulose preparations such as, e.g., 
maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl 
cellulose, hydroxypropylmethyl- cellulose, sodium carboxymethylcellulose, and/or 
polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as the 
cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium 

25 alginate. 
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Dragee cores are provided with suitable coatings. For this purpose, concentrated sugar 
solutions may be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, 
carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable 
organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or 
5 dragee coatings for identification or to characterize different combinations of active 
compound doses. ■ 

Pharmaceutical preparations which can be used orally include push-fit capsules made 
of gelatinras well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol 
or sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler 
10 such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate 
and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved or 
suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. 
In addition, stabilizers may be added. All formulations for oral administration should be in 
dosages suitable for such administration. 

15 For buccal administration, the compositions may take the form of tablets or lozenges 

formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention may be conveniently delivered in the form of an aerosol spray presentation, from . 
pressurized packs or a nebuliser, with the use of a suitable propellant, e,g., 
20 dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide 
or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined 
by providing a valve to deliver a metered amount. Capsules and cartridges of e.g. gelatin for 
use in an inhaler or insufflator may be formulated containing a powder mix of the compound 
and a suitable powder base such as lactose or starch. 

25 Alternatively, the active ingredient may be in powder form for constitution with a 

suitable vehicle, e.g. 9 sterile pyrogen-free water, before use. 
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In addition, the compounds may also be formulated as a depot preparation. Such long 
acting formulations may be administered by implantation (for example subcutaneously or 
intramuscularly) or by intramuscular injection. Thus, for example, the compounds may be 
formulated with suitable polymeric or hydrophobic materials (for example as an emulsion in 
5 an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a 
sparingly soluble salt 

Additionally, the compounds may be delivered using a sustained-release system, such 
as semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. 
Various sustained-release materials have been established and are well known by those skilled 
10 in the art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
biological stability of the therapeutic reagent, additional strategies for protein stabilization 
may be employed. 

The pharmaceutical compositions also may comprise suitable solid or gel phase 
1 5 carriers or excipients. Examples of such carriers or excipients include but are not limited to 
calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, 
and polymers such as polyethylene glycols. 

Pharmaceutical^ compatible salts may be formed with many acids, including but not 
limited to hydrochloric, sulfuric, acetic, lactic, tartaric, malic, succinic, citric, etc. Salts tend 
20 to be more soluble in aqueous or other protonic solvents that are the corresponding free base 
forms. In solutions, manipulation of pH is also routinely employed for optimizing desired 
properties. 

3. Determining Effective Dosages and Dosage Regimens 

Pharmaceutical compositions suitable for use in the present invention include 
25 compositions where the active ingredients are contained in an amount effective to achieve an 
intended purpose, e.g., a therapeutic and/or prophylactic use. A pharmaceutically effective 
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amount means an amount of compound effective to prevent, alleviate or ameliorate symptoms 
of disease or prolong the survival of the subject being treated. Determination of a 
pharmaceutical^ effective amount is well within the capability of those skilled in the art, and 
will typically assume an amount of between about 0.5 jig/kg/day and about 500g/kg/day, with 
individual dosages typically comprising between about 1 nanogram and several grams of 
immunoadhesin. 

For any compound used in the methods of the invention, the therapeutically effective > 
dose can be estimated initially from cell culture assays. For example, varying dosages can be 
administered to different animals or cell cultures and compared for effect. In this way, one 
can identify a desired concentration range, and prepare and administer such amount 
accordingly. Optimization is routine for one of ordinary skill in the art. 

The person of skill, in addition to considering pharmaceutical efficacy, also considers 
toxicity according to standard pharmaceutical procedures in cell cultures or experimental 
animals, e.g., for determining the LD50 (the dose lethal to 50% of the population) and the 
ED50 (the dose therapeutically effective in 50% of the population). The dose ratio between 
toxic and therapeutic effects is the therapeutic index and it can be expressed as the ratio 
between LD50 and ED50. Compounds which exhibit high therapeutic indices are^referredr 
The data obtained from these cell culture assays and animal studies can be used in formulating 
a range of dosage for use in humans. The dosage of such compounds lies preferably wthin a 
range of concentrations that include the ED50 with little or no toxicity. The dosage may vary 
within this range depending upon the dosage form employed and the route of administration 
utilized. The exact formulation, route of administration and dosage can be chosen by the 
individual physician in view of the patient's condition. (See e.g., Fingl et al y 1975, in "The 
Pharmacological Basis of Therapeutics," Ch. 1 p. 1). 

Dosage amount and frequency may be adjusted to provide mucosal levels of 
immunadhesin sufficient to maintain or provide a pharmaceutical effect, e.g., therapeutic 
and/or prophylactic. The minimal effective concentration (MEC) will Vary for each 
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immunadhesin and immunoadhesin fonnulation, but can be estimated from in vitro and/or in 
vivo data. Dosages necessary to achieve MEC will depend on individual characteristics and 
route of administration. However, assays as described herein can be used to determine 
mucosal concentrations, which can then be further optimized in amount and precise 
formulation. 

Dosage intervals can also be determined using MEC value. Compounds can be 
administered using a regimen which maintains mucosal levels above the MEC for. 1 0-90% of 
the time^ 30-90% of the time, or, most preferably, 50-90% of the time. 

The compositions may, if desired, be presented in a pack or dispenser device which 
may contain one Or more unit dosage forms containing the active ingredient. The pack may 
for example comprise metal or plastic foil, such as a blister pack. The pack Or dispenser 
device may be accompanied by instructions for aciministration. The pack or dispenser may 
also be accompanied with a notice associated with the container in form prescribed by a 
governmental agency regulating the manufacture, use, or sale of pharmaceuticals, which '■ 
notice is reflective of approval by the agency of the form of the immunoadhesin for human or 
Veterinary administration. Such notice, for example, may be the labeling approved by the 
U.S. Food and Drug Administration for prescription drugs, or the approved product insert. 
Compositions comprising a compound of the invention formulated in a compatible 
pharmaceutical carrier may also be prepared, placed in an appropriate container, and labeled 
for treatment of an indicated condition^ e.g. treatment or prophylaxis of a disease mediated by 
host organism/patient ICAM molecules. 

F. Methods of Treatment and Prevention of ICAM-mediated Afflictions 

A patient in need of therapeutic and/or prophylactic immunoadhesin chimeras of the 
invention, e.g., to counter rhinovirus infection and/or symptoms such as occur with colds, can 
be adrninistered a pharmaceuticaUy effective amount of desired immunoadhesin, preferably as 
part of a pharmaceutical composition determined, produced, and administered as described 
above. These formulations and delivery modalities can vary widely. Described following are 
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preliminary procedures that can be used to deduce effective amounts and toxicity, and which 
can then be conveniently used to determine treatment and prophylaxis parameters and 
regimens, both in humans and other animals. These procedures are illustrative only and are 
not intended to be limiting of the invention. Further, these procedures are routine for one of 
5 ordinary skill in the art. 

1 . Ability of the Immunoadhesin to Reduce Rhinovirus Inf ecthity in 
Humans: Dose Escalation Tolerance Study 

hnmunoadhesins of the invention may be tested, e.g., using randomized controlled 
trials to determine the effect of administration, e.g., intranasal, of immunoadhesin on 

1 0 infection. Other administration routes can be used. Various assays exist that can be used to 
monitor effect, e.g., IL-8 response assays assays that evaluate illness symptoms, e.g., cold 
symptoms caused by rhinovirus infection. These studies can evaluate the extent to which an 
immunoadhesin taken by a patient subjects can prevent or treat riiinovirus infection. For 
example, healthy or unhealthy subjects can be administered the immunoadhesin and 

1 5 evalutated over a time course, e.g., in tandem with rhinovirus inoculation and/or illness 
progression. The clinical protocols used may be based on protocols previously used in 
evaluation of a recombinant soluble ICAM-1 molecule for efficacy against rhinovirus 
infection, or modifications thereto (Turner, el al.,JAMA 281:1797-804, 1999). 

Male and female subjects of any species, age, health, or disease state can be evaluated 
20 The subjects may exhibit a serum neutralizing antibody titer in advance of study, which titer 
may fluctuate in response to infection and immunoadhesin administration. 

The immunoadhesin of the present invention may be formulated as a buffered saline 
with varying amounts of immunoadhesin within and administered at various intervals to a 
patient. Single ascending dose and multiple ascending dose studies can be used to evaluate 
25 the safety of the immunoadhesin. In each case, one or more subjects may be evaluated at each 
dosage level, some receiving the immunoadhesin, and one or more optionally receiving 
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placebo. In either study, multiple dosage levels may be evaluated. Dosage levels can vaiy, 
but are typically in the nanogram to gram range. 

Dosages may be administered over seconds, minutes, hours, weeks, and months, and 
evaluated for toxicity and/or pharmaceutical effect. 

5 Safety and toxicity may be assessed, e.g., by visual examination of the nasal mucosa 

for signs of iiritation or inflammation. Blood safety evaluations can also be employed 
according to routine methods and using sensitive assays such as ELISA to determine various 
blood components, including circulating immunoadhesin and rhinovirus quantities. Naval 
lavage testing may similarly be done according to routine methodologies. 

10 Routine statistical analyses and calculations may be employed to determine efficacy 

and toxicity predicted over time courses for single patients and/or for populations of patient- 
recipients.. 

Challenge studies as well known in the, art can be used to demonstrate that treatment 
protects against clinical colds and/or reduces cold symptoms after viral challenge, and using 
15 commercially available starting materials such virus, cells, and animals. See, e.g. 9 Gwaltney, 
el al, Prog. Med. Virol 39:256-263, 1992. 

The following examples illustrate the disclosed invention. These examples in no way 
limit the scope of the claimed invention. 
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EXAMPLES 

!• Constructi on of Immunoadhesin Expression Cassettes 

A cassette encoding ICAM-1 extracellular domains Dl through D5 was prepared by 
PCR cloning. Specifically, a fragment containing all five extracellular Ig-like domains of 
ICAM-1 was amplified from plasmid P CDICl-5D/TgA (insert Martin, et al. reference) using 
the following oligonucleotide primers: - 

5 '-TCTG^CCCAGGAACTA^TTTGGCACAGACATCTGTGTCCCCCTCAAAAGTC-S ' 
(SEQIDNO:6) 

5 '-C ATACCGGGGACTAGTCAC ATTC ACGGTC ACCTCGCGG-3 ' 
(SEQIDNO:7) 

These two primers were designed to introduce Spel sites at the 5' and 3 ' ends of the 
PCR fragment (underlined nucleotides). PCR was performed with Pfu polymerase 
(Stratagene) to reduce accumulation of errors.- The PCR fragment was cloned into the vector - 
PCRScript (Stratagene), and sequenced before fusing to the human IgA2 cassettes (with and 
without SEKDEL at the caAoxy-terminus). 

Constructs for the expression in plants of human J chain and secretory component, as., 
well as a human IgA2 heavy chain, were developed. A heavy chain expression cassette vector 
was made and called P SSpHuA2 (See FIG. 1). It contains sequence encoding a bean legumin 
signal peptide (Baumlem ef al., Nucleic Acids Res. 14 (6), 2707-2720, 1986). The sequence 
of bean legurnin is provided as GenBank Accession No. X03677, and the sequence of the 
bean legiirnin signal peptide is SEQ ID NO: 10 (also see Fig. 8) and the IgA2m(2) constant 
region with Spel and Sad sites in between, and the SuperMas promoter for driving the 
expression of a signal peptide and the constant regions of the human IgA2m(2) heavy-chain. 

The amplified DNAs encoding the first five domains of human ICAM-1, and the Fc 
region of human IgA2m(2) were fused in a plant-expression cassette to make a chimeric 
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ICAM-1 molecule expression construct, shown in FIG. 2A. This was done by cloning the 
fragment encoding the five extracellular domains of ICAM-1 into vector pSSPHuA2 to 
generate P SSPICAMHuA2. The convenient restriction sites (5' Spel and 3' Spe I) allowed 
the amplified fragment to be inserted between the signal peptide and the Gal domain. In the 
resulting construct, expression of the chimeric ICAM-1 molecule is under the control of the 
constitutive promoter "superMAS" (Ni etal, 1995) and the nos 3' terminator region. 

The resulting chimeric ICAM-1 molecule construct contains no variable region. Upon 
translation of the mRNA, signal peptide cleavage is predicted to deposit the ICAM-l-heavy 
chain fusion into the plant cell's endoplasmic reticulum (ER). DNA encoding an ER retention 
signal (RSEKDEL, SEQ ID NO: 5) was appended to the 3 ' end of the heavy-chain in order to 
boost the expression level of the construct The amino acid sequence SEKDEL (SEQ ID NO: 
4) is the consensus signal sequence for retention of proteins m the endoplasrdc reticulum m 
plant cells. This sequence has been shown to enhance accumulation levels of antibodies in 
plants (Schoutentf al, Plant Molecular Biology 30:781-793,1996). The amino acid sequence 
of the chimeric ICAM-1 molecule construct is* shown in FIG. 2B. The DNA sequence and 
translational frame of the construct was verified before it was used for particle bombardment. 

It has been shown recently that assembly of J chain with IgA takes place in the Golgi 
apparatus (^oo etal, J. Biol Chem. 274:33771-33777, 1999), and so constructions of heavy. . 
chain without SEKDEL have been made as well. The ICAM-1 fragment was cloned into an „ 
expression cassette containing the IgA2m(2) constant region without SEKDEL. 

2 - Expression of Assembled Immnnnarihqsin in Plants 
A. Immunoadliesin Expression Vectors 

The plasmid P SSPICAMHuA2 (SEQ ID NO:9 and FIG. 8) is 6313 bp in length. 

Nucleotides 49-1 165 represent the Superpromoter (Ni et al. Plant Journal 7:661-676, 1995). 

Nucleotides 1 166-3662 comprise a sequence encoding a human ICAM-l/human IgA2m(2) 

constant hybrid with linker sequences. A consensus Kozak sequence (Kozak, Ce//44(2):283- 

92, 1986) is included (nt 1186-1 192) to enhance translation initiation, as well as the signal 
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peptide from V. faba legumin (nt 1189-1257; Baumlein et al, Nucleic Acids Reg. 14(6):2707- 
2720 (1986). The sequence of the human IgA2m(2) constant region (nt 3663-3633) has been 
previously published (Chintalacharuvu, et al, J. /mm. 152: 5299-5304, 1994). A sequence 
encoding the endoplasmic reticulum retention signal SEKDEL is appended to the end of the 
5 heavy Chain (nt 3634-3654). Nucleotides 3663-3933 derive from the nopaline synthase 3 s 
end (transcription termination and polyadenlyation signal; Depicker et al., 1982). The 
remainder of the plasmid derives from the vector pSP72 (Promega). 

The plasmid pSHuJ (SEQ ID NO: 1 1 and FIG. 8) is 4283 bp in length. Nucleotides 
14-1136 represent the Superpromoter (Ni et al, Plant Journal 7:661-676, 1995) and 

10 nucleotides 1 137-1648 are shown in FIG. 8 and comprise a sequence encoding the human J 
Chain including the native signal peptide (Max and Korsmeyer, Jlmm. 152:5299-5304, 1985) 
along with linker sequences. A consensus Kozak sequence (Kozak, Cell 44(2):283-92, 1986) 
is included (nt 1 162-1 168) to enhance translation initiation. Nucleotides 16494902 derive 
from the nopaline synthase 3' end (transcription termination and polyadenlyation signal; 

15 Depicker et al, JMolAppl Genet l(6):561-73, 1982). The remainder of the plasmid derives - 
from the vector pSP72 (Promega). 

The plasmid pSHuSC (SEQ ID NO: 12 and FIG. 8) is 5650 bp in length. Nucleotides 
13-1136 are derived from the Superpromoter (Ni et al, Plant Journal 7:661-676, 1 995), and 
nucleotides 1 1 37-298 1 comprise a sequence encoding the human Secretory Component 

20 including the native signal peptide (Krajci, etal, Biochem. and Biophys. Res. Comm 158:783, 
1994) along with linker sequences. A consensus Kozak sequence (Kozak, Cell 44(2):283-92, 
1986) is included (nt 1 151-1 157) to enhance translation initiation. Nucleotides 2982-3236 
derive from the nopaline synthase 3 * end, providing a transcription termination and 
polyadenlyation signal, described in Depicker et al, JMolAppl Genet l(6):561-73 (1982). 

25 The remainder of the plasmid derives from the vector pSP72 (Promega). 

The plasmid pBMSP-1 (SEQ ID NO: 13 and FIG. 8) is derived from pGPTV-KAN. 
Becker et al, in Plant Molecular Biology 20, 1 195-1 197, (1992), describe new plant binary 
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vectors with selectable markers located proximal to the left T-DNA border, and the sequences 
outside of the left and right borders. Nucleotides 18-187 of pBMSP-1 represent the right T- 
DNA border, and nucleotides 1811-775 represent the superMAS promoter. Nucleotides 
2393-2663 represent the NOS promoter (Depicker et ah, J MolAppl Genet l(6):561-73, 
1982), nucleotides 2698-3492 encode the NPTII gene (conferring resistance to kanamycin), 
and nucleotides 351 1-3733 are the polyadenylation signal from A. tumefaciens gene 7 (Gielen 
et al, Embo J 3:835-46, 1984). Nucleotides 1768-976 encode the NPTII gene, and 
nucleotides 43 1 7-4464 represent the left T-DNA border. 

The plasmid pBMSP-lspJSC (SEQ ID NO:14 and FIG. 8) is a derivative of pBMSP, 
containing both J and SC under control of superpromoter. In this plasmid, nucleotides 1-149 
represent the left T-DNA border. Nucleotides 955-733 are the polyadenylation signal from A. 
tumefaciens gene, nucleotides 1768-976 encode the NPTII gene (conferring resistance to 
kanamycin), and nucleotides 2073-1803 represent the NOS promoter. Nucleotides 2635-3768 
represent the superMAS promoter, nucleotides 3774-5595 encode the Human Secretory 
component, and nucleotides 5603-5857 represent the NOS polyadenylation. signal. 
Nucleotides 5880-6991 represent the superMAS promoter, nucleotides 7007-7490 encode the 
Human Joining Chain, and nucleotides 7504-7757 represent the NOS polyadenylation signal. 
Nucleotides 7886-8057 represent the right T-DNA border. 

The plasmid pGPTV-HPT, encoding the enzyme conferring hygromycin resistance, is 
available commercially from the Max-Planck-Institut fur Ziichtungsforschung (Germany). It 
is described by Becker in Plant Molecular Biology 20, 1 195-1 197 (1992). 

B. Plant Transformation and Immunoadhesin Expression in Plants 

The expression cassettes described above were used to produce the assembled 
immunoadhesin in plants. Plasmids pSSPICAMHuA2, pSHuJ, pSHuSC and pBMSP-1 were 
co-bombarded into tobacco leaf tissue (AT. tabacum cultivar Xanthi) and transformed 
microcalli were selected on nutrient agar in the presence of kanamycin. Individual microcalli, 
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indicative of independent transformation events, were dissected from the parent tissue and 
propagated on nutrient agar with kanamycin. 

The callus tissues were screened for transgene expression. Callus #7132 was shown to 
express a chimeric ICAM-1 immunoadhesin and J chain by immunoblotting and PCR (data 
not shown). This callus did not possess DNA encoding the SC. The callus grew well in 
culture and, upon accumulation of sufficient mass, #7132 was bombarded again, this time 
with two of the plasmids described above, PBMSP-1 SpJSC, containing expression cassettes 
for both the J chain and SC and pGPTV-HPT, containing an expression cassette for the hpt I 
gene which confers hygromycin resistance. After a period of selection and growth on nutrient 
agar, several independent transfonnants were identified, by immunoblotting, that expressed 
the chimeric ICAM-1 molecule, the J chain and SC in several states of assembly. 

FIG. 3 illustrates the expression of the chimeric ICAM-1 molecule in independently 
transformed tobacco calli. FIG. 3A shows immunoblots of non-reducing SDS- 
polyacrylamide gels on which samples containing different transformed tobacco calli (C) and 
aqueous extracts (Aq) were run and probed for the presence of human JCAM. The solubility 
of the immunoadhesin assured us that extraction could be easily performed, and the similarity 
of signals leads us to believe in the reproducibility of expression. FIG. 3B shows 
immunoblots of nonreducing SDS-polyacrylamide gels containing various fractions of. 
partially purified immunoadhesin from callus Rhil07-ll. The blots were probed with 
antibodies against human ICAM (-ICAM), human IgA heavy chain (~a), human secretory 
component (~SC) and human J chain (~J). Secondary, enzyme-conjugated antibodies were 
employed as necessary to label immuno-positive bands with alkaline phosphatase. The 
specificity of immuno-blotting was further verified by a failure to detect immuno-positive 
bands in extracts of non-expressing calli (not shown). The expected MW for a dimerized 
chimeric ICAM-1 molecule, without glycosylation, is 173,318; this form is likely present in 
the band migrating just below the 250kD marker since it is immuno-positive for ICAM-1 and 
heavy-chain. This band is also immuno-positive for SC (total expected MW of -248 kD) but 
not for J chain which is somewhat unexpected given the canonical pathway for SIgA 
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assembly, which involves 2 cell types (in mammalian) and requires the presence of J chain 
prior to assembly of SC. A tetrameric immunoadhesin, containing a single molecule of J 
chain and a single molecule of SC, has an expected MW of -440 kD, prior to glycosylation. 
Several species with molecular weights well in excess of 200 kD, immuno-positive with all 
four probes, are readily apparent. 

Bombardment with DNA-coated microprojectiles is used to produce stable 
transformants in both plants and animals (reviewed by Sanford et al, Meth. Enz. 217:483- 
509,1993). Particle-mediated transformation with the vectors encoding the immunoadhesin of 
the present invention was performed using the PDS-1000/He particle acceleration device, 
manufactured by Bio-Rad. The PDS-1000/He particle acceleration device system uses 
Helium pressure to accelerate DNA-coated microparticles toward target cells. The physical 
nature of the technique makes it extremely versatile and easy to use. We have successfully 
transformed tobacco with all four components of a secretory IgA simultaneously. 

The basic biolistic procedure was performed as follows: A stock suspension of 

microprojectiles was prepared by mixing 60 mg of particles in 1 ml of absolute ethanol. This 

suspension was vortexed and 25-50 ul was removed and added to a sterile microcentrifuge 

tube. After micfocentrifuging for 30 seconds the ethanol was removed and the pellet - — - ---- ~ 

resuspended in 1 ml sterile water and centrifuged for 5 minutes. The water was then removed 

and the pellet resuspended in 25-50 ul of DNA solution containing a mixture of plasmid 

PNAs, usually, but not always in equimolar amounts. The amount of plasmid added varied 

between 0.5 ng and 1 jig per preparation. The following were added sequentially: 220 ul of 

sterile water, 250 ul of 2.5M CaC12, and 50 ul of 0.1M spermidine. This mixture was 

vortexed for at least 10 min and then centrifuged for 5 min. The supernatant was removed 

and the DNA/microprojectile precipitated in 600 ul of absolute ethanol, mixed and 

centrifuged lmin. The ethanol was removed and the pellet resuspended in 36 ul of ethanol. 

Ten ul of the suspension was applied as evenly as possible onto the center of a macrocarrier 

sheet made of Kapton (DuPont) and the ethanol was evaporated. The macrocarrier sheet and 

a rupture disk were placed in the unit. A petri dish containing surface-sterilized tobacco 
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leaves was placed below the stopping screen. The chamber was evacuated to 28-29mm Hg 
and the target was bombarded once. The protocol has been optimized for tobacco, but is 
optimized for other plants as well by varying parameters such as He pressure, quantity of 
coated particles, distance between the macrocarrier and the stopping screen and flying 
distance from the stopping screen to the tissue. 

Expression cassettes for chimeric ICAM-1 molecules were also assembled in binary 
vectors for use in Agrobacterium-mediBted transformation. An Agrobacterium binary vector 
designed for expression of both human J chain and human secretory component, as well as 
kanamycin resistance, was introduced into A. tumefaciens strain LBA4404. The chimeric 
ICAM/IgA molecule in another binary vector was also used to transform LBA4404. 
Overnight cultures of both strains were used for simultaneous "co-cultivation" with leaf 
pieces of tobacco, according to a standard protocol (Horsch et aL, Science 227:1229-1231, 
1985). 

A standard protocol for regeneration ofboth bombarded and Agrobacterium- 
transformed tobacco leaf disks was used (Horsch et al, Science 227:1229-1231, 1985> 
Because transformed plants, regenerated from bombarded tissue, frequently undergo gene- 
silencing upon maturation, transgenic tobacco plants were prepared via Agrobacterium' • 
mediated transformation, which gives a higher yield of expressing, mature plants. - 

3. Purification of Assembled Immunoadhesin 

The immunoadhesin expressed according to Examples 3 was purified. Galli were 
grown in large amounts to facilitate the development of extraction procedures. A partial 
purification schedule provided a stable concentrate, available in a variety of buffer conditions, 
for investigation of subsequent chromatographic techniques for the further purification of the 
immunoadhesin (See FIG. 3). Calli were extracted in a juicer, which crushes tissue between 
two stainless-steel gears, while bathed in a buffer containing sodium citrate (0.6 M, pH 7.4) 
and urea (final concentration of 2 M). The juice (~1 ml/g fresh weight) was precipitated, after 
coarse filtration through cheesecloth, with 0.67 volumes of saturated ammonium sulfate. A 
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green pellet was collected after centrifugation and thoroughly extracted, in a small volume of 
50 mM sodium citrate (pH 6.6), with a Dounce homogenizes After additional centrifugation, 
a clear brown supernatant was collected and partially purified, during buffer exchange in a 
de-salting mode, by passage through a Sephadex G-100 column. The desalting/buffer 
5 exchange step has allowed preparation of a partially purified concentrate (-0.2 ml/ g fresh 
weight callus) in a desirable buffer, the G-100 column was eluted with 0.25 X phosphate 
buffered saline. This eluate appeared to be stable for at least 10 days at 2-8°C. 

4. The Immunoadhesin Inhibits Human Rhinovirus Infectivitv 

The infectivity of cells by human rhinovirus was demonstrated to be inhibited by the 

10 immunoadhesin prepared according to Example 3. Callus extract prepared according to 
Example 3 successfully competed for binding of an anti-ICAM monoclonal antibody to 
soluble ICAM-L FIG. 4 shows the data from an enzyme-linked immunosorbent assay 
(ELISA). For the assay, 96-well plates were coated with 0.25 [ig soluble ICAM-l/ml. The 
squares represent the increasing concentrations of sICAM and the circles represent the 

15 increasing amounts of callus extract (sterile filtered fraction from G-10Q) used to .compete 
with the adhered ICAM for a constant amount of a mouse (anti-human IC AM) antibody. 
After washing the wells, adherent mouse antibody was detected with an anti-mouse antibody _ 
conjugated to horseradish peroxidase. Adherent enzyme activity was measured at 490 nm, 
with ortho-phenylene diamine as a substrate. The data (squares, sICAM; circles, Extract) are . 

20 well described by sigmoids of the form OD490 = y =y 0 + a/[l+e A -{(x-x 0 )/b}], where a = y 
max, yo = y min, b = the slope of the rapidly changing portion of the curve and xo = the value 
of x at the 50% response level. Relative to soluble ICAM-1, the immunoadhesin extract 
tested here contains the equivalent of -250 fig ICAM/ml; this is an overestimate due to 
expected avidity effects of the dimeric and tetrameric assemblies of the ICAM-l-heavy-chain 

25 fusions. Thus, this ELISA demonstrated that the immunoadhesin competes with soluble 
ICAM-1 for binding to an anti-ICAM mAb. 

The competitive ELISA allows for quantitative assessment of the recovery of activity 

by comparing the normalized amounts of various fractions required to give a 50% response. 
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Upon purification, the titer of a immunoadhesin preparation may be expressed as a reciprocal 
dilution, or the number of milliliters to which a milligram of immunoadhesin must be diluted 
in order to give a 50 % response. This ELISA will facilitate the development of a purification 
process for the immunoadhesin. 

A cytopathic effect assay (CPE) demonstrated the specific ability of the partially 
purified immunoadhesin to inhibit the infectivity of human cells by human rhmovirus (FIG. 
5). Rhinovirus serotype HRV-39 was pre-incubated with human ICAM-1, an ICAM/IgA 
fusion (gift of Dr. Tim Springer), or with extracts from calli either expressing our 
ICAM-l/SIgA immunoadhesin or another, different, antibody before plating each of the 
mixtures with HeLa S3 cells at 33°C. After 3 days/viable cells were fixed and stained with a 
methanolic solution of Crystal Violet; the optical density at 570 nm provides a proportional . 
measure of cell viability. 

Two extracts derived from Rhil07-1 1, containing the immunoadhesin, clearly 
inhibited the virus' ability to infect and kill HeLa S3 cells (FIG. 5A, right side-up and 
upside-down triangles). Because the extracts were only partially purified, we also assayed a 
similarly prepared extract that contained a human IgA2m(2) directed against Doxorubicin, a 
chemotherapeutic agent. That extract, containing a similar immunoglobulin with an unrelated 
binding specificity, was unable to inhibit the infectivity of the rhinovirus and demonstrates : ... 
that expression of the ICAM-1 -heavy-chain fusion confers specificity to the inhibition. The 
CPE assay demonstrated, as expected, the differential ability of souluble ICAM-1 and an 
(ICl-5/TgA; Martin, e/ gl, 1993) to inhibit viral infectivity (FIG. 5B). The insert in Figure 
56 is the scale expansion in the range of the IC50 for soluble ICAM-1 (1.35 ug/ml) and for 
the ICI-5/IgA (0.12 ug/ml; 1 1 .3 fold less). 

5 - Production and Purification of Tmnwn oadhesins for Clinical and TnviVnlngfo fll 
Studies 6 — • 

Production of sufficient immunoadhesin for the proposed clinical and toxicological 
needs is performed by making transgenic tobacco plants. The transgenic plants which express 
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the immunoadhesin (without an ER retention signal) are generated by 
Agrobacterium-meAitsteA. transformation. The absence of an ER retention signal is anticipated 
to enhance assembly since the nascent SIgA is processed through the entire Golgi apparatus, 
including, in particular, the trans-Golgi, where SC is covalently linked to dlgA as suggested 
by pulse-chase experiments (Ghintalacharuvu & Morrison, Immunotechnology 4: 1 65-1 74, 
1999). Because Agrobacterium-mediated transformation is much more likely to generate 
plants with consistent levels of transgene expression, it is likely that progeny of these plants 
will be used for the production of clinical grade immunoadhesin. 

In order to maximize expression levels, and create a true-breeding line, it is desirable 
to create homozygous plants. The highest producing plants (generation T 0 ) can self-fertilize 
in the greenhouse before seed is collected. One quarter of the Ti plants are expected to be 
homozygous. These are grown in the greenhouse and seed samples fiom several plants are 
separately germinated on medium containing kanamycin. All the progeny (T 2 ) from 
homozygous positive plants are expected to be green. Some of the progeny of heterozygous 
plants are expected to be white or yellowish. Homozygosity is confirmed by back-crossing to 
wild-type and hnmurioblotting extracts of the progeny. 

Harvesting and processing may be continuously meshed during a production 
campaign, especially since multiple harvests may be obtained from a single planting, i.e. 
plants cut to soil level for one harvest are regrown for subsequent harvests. In developing a 
sense of scale for the production of immunoadhesin it is necessary to decide on the required 
amount of finished immunoadhesin, account for expression levels (mg immunoadhesin 
present/ kg fresh weight tobacco), know the growth rate of the plants and the expected weight 
of the average plant, and the overall yield of the purification schedule (set at 20%). Setting 
the overall need at 3 g of finished immunoadhesin requires preparing for 4 harvests, each with 
an expected yield of 1 g of finished immunoadhesin. 

Given these targets and parameters, the necessary number of plants and hence the 
space requirements for plant growth is determined. FIG. 6 shows an evaluation of the 
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production necessities for making 1 gram of finished Immunoadhesin. In this diagram, the 
number of plants needed for 1 g of immunoadhesin, at 20% yield, at expected levels of 
expression and plant weight is illustrated. At different levels of immunoadhesin expression 
(mg/kg fresh weight) and overall recovery (set at 20%), the weight of each plant, and so the 
5 total number of plants, may be determined for a specified production target (1 g/harvest) 
within a window (dotted square) of reasonable possibilities. The number of required plants 
decreases, inversely, with the number of specified growth and re-growth periods. The 
expected biomass production, a function of time and growth conditions, influences the time to 
harvest arid the time between harvests. These growth periods can be adjusted to the realities 

10 of the purification schedule by staggering planting and harvesting dates. From our 
experience, production requires x number of plants. For example, 1 g of finished 
immunoadhesin from plants with a reasonable expression level, of 1 00 mg of 
immunoadhesin/kg fresh weight, require 250 plants when harvested at a weight of 200 g/plant 
(-80 days post germination). At this scale, these plants require about 10 m 2 of growing space 

1 5 and are harvested twice over 1 50 days. 

Processing 50+ kg of biomass at a time requires several moderately large-scale 
operations which all have counter-parts in the food-processing industry. These includebulk . 
materials handling, size reduction, juicing and filtration. A Vincent Press and a Durco . 
filtration system are used to efficiently process these quantities. The juicing step employs a 
20 proven and simple buffer of sodium citrate and urea* These components buffer the extract, 
help prevent the oxidation of phenolics and their association with proteins (Gegenheimer, 
Methods in Enzymology 182:174-193, 1990; Loomis, Methods in Enzymology, 31:528-544, 
1974; Van Sumere, et al, The Chemistry and Biochemistry of Plant Proteins, 1975.) and 
ensure the solubility of the immunoadhesin during a subsequent acid precipitation. 

25 Filtration of acid-insoluble lipid and protein (-90% of the total) is followed by 

tangential flow ultrafiltration to concentrate the immunoadhesin and to remove small proteins, 
especially phenolics. Diafiltration enhances the removal of small molecules and exchanges 
the buffer in preparation for short-term storage and subsequent chromatography. Either 
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SP-Sepharose (binding at pH 5.0 or below) or Q-Sepharose (binding at pH 5.5 or above) are 
among the ion-exchanges that can be used for filtering immunoadhesin. They are readily 
available, scalable, robust and have high capacities. In particular, they are available for 
expanded-bed formats, which reduce the stringency of prior filtration steps. Cation-exchange 
chromatography, which can be more selective than anion-exchange chromatography, is used 
first. The immunoadhesin is purified from the several species of protein potentially present, 
to the point where at least 95% of the protein is in the form of ICAM-l/IgA, ICAM-l/dlgA or 
ICAM-l/SIgA, as the presence of di- and tetra-valent ICAM-1 domains are critical for potent 
anti-viral activity. Purified inimunoadhesin is then tested for acceptable levels of endotoxin, 
alkaloids such as nicotine and for bio-burden. In addition, potency levels (defined by ELBA 
and CPE assays), protein concentration, pH and appearance are monitored. Subsequently, the 
stability of the clinical lots of mimunoadhesin is determined, to ensure that patients receive 
fully potent immunoadhesin. Even partially purified extracts have been found to be stable for 
10 days when refrigerated. The titer and potency of clinically formulated immunoadhesin (in 
phosphate-buffered saline), when stored at -20°C, 2-8°C s and at 37°C, over a period of 3 to 6 
months, is also tested. 

6 - The Immunoadhesins Have Plant-Specific Glycosvlation 

The immunoadhesins produced are analyzed to determine the pattern of glycosylation 
present Cabanes-Macheteau et al.,Glycobiology 9(4):365-372 (1999), demonstrated the 
presence of several glycosyl moieties, typical of plants, on a plant-expressed antibody 
construct Their methods are used to demonstrate that the immunoadhesins produced 
according to Example 1, 2 and 3 have a plant-specific glycosylation pattern. We anticipate 
that this diversity will also be a source of variability for immunoadhesin. Since crude extracts 
have been shown to have anti-viral activity in vitro (data not shown), glycosylation, as such, 
does not appear to affect potency. N-linked glycosylation (FIG. 2 shows that there are fifteen 
potential sites on me chimeric ICAM-1 molecule alone) probably contributes to the diversity 
of bands seen in immuno-blots. Immunoadhesin preparations are digested with 
N-Glycosidase A, before blotting, showing that the difference in banding patterns collapse 
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into fewer, discrete bands. In this way, glycoforms are initially characterized with reducing 
and non-reducing polyacrylamide gels. In addition, digested and mock-digested fractions are 
tested in the CPE assay and competition ELISA, demonstrating the effect of N-linked 
glycosylation on potency and titer in vitro, 

7. The Immunoadhesin Inactivates Human Rhinovirus 

The immunoadhesin prepared according to Examples 1, 2 and 3 is assayed for its 
ability and to inactivate HRV by binding to the virus, blocking virus entry, and inducing the 
formation of empty virus capsids. To measure binding of the immunoadhesin to HRV, the 
immunoadhesin is incubated with [^leucine-Iabeled HRV-39 for 30 min and then added to 
HeLa cells for 1 hr. After washing, cells and bound virus are detached with Triton X-100 and 
I 3 !!] measured in a scintillation counter. 

mactivation of HRV-39 by incubation with the immunoadhesin is compared with 
HRV inactivation by sICAM-1. HRV-39 is not directly inactivated to a significant extent 
(<0.5 logio reduction in infectivity) by incubation with monomeric sICAM-1, while 
incubation with ICl-5D/IgA reduced infectivity approximately 1.0 logw (Arfuda, et al, : " 
Antimicrob. Agents Chemother. 36:1186-1191, 1992; Crump, etal, Antimicrob. Agents 
Chemother. 38: 1425-7, 1994). In order to test the ability of the irnmunoadhesin to inactivate' " 
HRV-39^ 10 6 50% tissue culture infective doses (TCID50) of HRV-39 are incubatedin • : 
medium containing a concentration of sICAM-1 or immunoadhesin equal to ten times the IC 50 
of each molecule for that virus, or in plain medium, for 1 hr at 33°C on a rocker platform. 
Each virus-immunoadhesin or virus-medium mixture are then diluted serially in ten-fold 
dilutions, and the titer determined on HeLa cells in 96-well plates. 

The effect of the immunoadhesin on HRV attachment to host cells is tested by 
inoculating HeLa cells with HRV-39 at a MOI of 0.3 in the presence or absence of the 
immunoadhesin. Absorbance proceeds for one hour at 4°C, the cells are washed, and media is 
replaced plus or minus the immunoadhesin. Cells are incubated for ten hours at 33°C (to 
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allow one round of replication), and virus are harvested by freeze/thawing the cells. The virus 
is titered on HeLa cells. 

ICAM-IgA (ICl-5D/IgA) is more efficient than sICAM-1 at inducing conformational 
changes in HRV, leading to the formation of empty, non-infectious viral particles (Martin, et 
5 al J. Virol 67:3561-8, 1993). To examine the ability of the immunoadhesin produced 

according to Examples 1, 2 and 3 to induce conformational changes in HRV, causing release 
of viral RNA, purified immunoadhesin is incubated with [ 3 H]leucine-labeled HRV-39 for 30 
min and then the virus is overlayed onto a 5 to 30% sucrose gradient Following 
centrifugation for 90 min at 40,000 rpm, fractions are collected, [ 3 H] measured, and fractions 
10 assessed for infectivity. (Intact HRV sediments at 149S on a sucrose gradient while empty 
capsids lacking RNA sediments at 75S (Martin, et al J. Virol 67:3561-8, 1993)). Due to its 
increased valence, we expect the ICAM/SIgA immunoadhesin is more efficient at inducing 
empty non-infectious particles than ICAM-IgA. 

The inhibitory effect of purified immunoadhesin on a panel of both major and minor 
15 (that do not use ICAM-1 as a receptor) HRV serotypes will be examined using the CPE assay. 
The ability of ICAM-1 to inhibit HRV infection varies among Viral isolates. It has been 
shown (Crump, et al, Antimicrbb. Agents Chemother. 38:1425-7, 1994) that the EC50 for 1 
sICAM-1 varies from 0.6 ng/ml to >32 \ig/ml when tested on a panel of HRV major receptor 
serotypes assay using HeLa cells. Our panel includes nine major serotypes (HRV-3 ~ -13, -14, 
20 -16, -23, -39, -68, -73, and -80) and the minor receptor serotype HRV-1 A. 

8. Clinical Studies Demonstrating the Ability of the Immunoadhesin to Reduce 
Infectivity in Humans: Dose Escalation Tolerance Study 

The immunoadhesin of the present invention is tested in two randomized controlled 
trials to determine the effect of intranasal administration of the immunoadhesin on infection, 
25 11^8 response, and illness in experimental rhinovirus colds. These two studies evaluate the 
immunoadhesin taken by subjects before or after rhinovirus inoculation. The clinical 
protocols used here are based on protocols previously used by in evaluation of a recombinant 



60 



WO 01/83529 



PCT/US01/13932 



soluble ICAM-1 molecule for efficacy against rhinovirus infection (Turner, et al, JAMA 
281:1797-804, 1999). 

A. Subjects. 

Subjects are recruited from university communities at the University of Virginia, 
5 Charlottesville. Subjects are required to be in good health, non-smokers, and between the 
ages of 18 and 60 years. Subjects are excluded if they have a history of allergic disease or 
nonallergic rhinitis, abnormal nasal anatomy or mucosa, or a respiratory tract infection in the 
previous 2 weeks. Pregnant or lactating women or women not taking medically approved 
birth control are also excluded. In the experimental virus challenge study (Phase I/n, see 
10 below), subjects are required to be susceptible to the study virus as evidenced by a serum 

neutralizing antibody titer of 1 :4 or less to the virus, determined within 90 days of the start of 
the trial. 

B. Study Medication. 

The immunoadhesin of the present invention is formulated as a phosphate-buffered 
15 saline (PBS) spray solution containing 2.6 mg/ml; The placebo consists of PBS and is 
identical in appearance to the active preparation. The solutions are administered using a 
medication bottle equipped with a metered nasal spray pump. The pump delivers 70 ^1 of 
solution containing 183 jig of the immunoadhesin with each spray. The medication is 
administered as two sprays per nostril, six times daily (at 3-hour intervals) for a total daily 
20 dose of 4.4 mg. This is the same dose, in mg protein/day, as was used for soluble ICAM-1 in 
the tremacamra study infection (Turner, et al, JAMA 281:1797-804, 1999). A mole of the 
immunoadhesin has about twice the mass as a mole of sICAM-1 . However, given the 
differences in in vitro activity between sICAM-1 and ICAM/IgA fusions, the immunoadhesin 
is many fold more effective on a molar basis than sICAM-1 . Thus, this amount is a 
25 conservative calculation of what is necessary. This amount is used, except in the event that 
the dose escalation study reveals problems at this dose. 
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C. Study Design 

Single ascending dose and multiple ascending dose studies are used to evaluate the 
safety of the immunoadhesin. In each case, three subjects are evaluated at each dosage level, 
two receiving the immunoadhesin and one receiving placebo. In the single ascending dose 
5 study, four dosage levels are evaluated. The lowest individual dose is half the anticipated 
dose to be used in the challenge study, and the highest individual dose is twice the anticipated 
challenge study dose. The dosage levels are as follows: one spray in each nostril (366 jig 
total), two sprays in each nostril (732 ng total), three sprays in each nostril (1098 \ig total), 
four sprays in each nostril (1464 \ig total). 

10 The same dosage levels are used in the multiple ascending dose study. Subjects , 

receive doses every three hours (six times per day) for five days. In both studies subjects are 
evaluated at each dosage level, staggering the start of each subsequent level until it is clear 
that there is no acute toxicity at the previous level. All subjects return for a single dose 21 
days after the first dose, and then for a follow-up at six weeks (for determination of serum 

15 antibody against the immunoadhesin). 

A separate group of twelve subj ects is given one dose of two sprays in each nostril 
(732 \ig total), and nasal lavage is done at 1, 2, 4, 8 and 16 hours (two subjects at each time 
point). Washings are assayed at Panorama Research by ELIS A for the immunoadhesin in 
order to calculate its in vivo half-life. The total amount of the immunoadhesdn to be used in 
20 the dose escalation and half-life determination studies (on a total of 28 subjects) will be 
approximately 270 mg. 

D. Safety Evaluations. 

In addition to routine adverse event recording, the safety of the immunoadhesin is 
assessed in three ways. First, prior to the first dose and after the last dose the investigators 
25 perform a visual examination of the nasal mucosa, in particular looking for signs of irritation 
or inflammation. Any visible changes are noted. Second, standard blood safety evaluations 
are done on samples collected prior to treatment and after the last dose on study days 1, 4, and 
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8 (and 21 in the multiple ascending dose study). Third, serum samples are saved, frozen, and 
used to determine if the immunoadhesin is able to pass through the nasal mucosa into the 
blood. This is accomplished in two ways. First, the presence the immunoadhesin in serum 
samples is measured by ELISA. ]h this assay, anti-human IgA antibodies adsorbed to 
microtiter plates capture any the immunoadhesin in the serum, which are detected by an 
anti-ICAM antibody. The sensitivity of the assay is determined using normal human serum 
samples spiked with known concentrations of the immunoadhesin. Alternatively, anti-ICAM 
antibodies can be adsorbed to plates to capture the immunoadhesin in the serum, that would 
be detected by anti-IgA. Second, the presence of an immune response to the immunoadhesin 
is assayed with an ELISA method that uses the immunoadhesin adsorbed to microtiter plates. 
Any anti-immunoadhesin antibodies in the serum bind, and are detected with anti-human IgG 
or anti-human IgM. Pre-treatment and post-treatment serum samples are compared, and any 
change in titer is considered evidence of uptake of the immunoadhesin, If there is any 
positive evidence of anti-immunoadhesin antibodies, additional assays will be done to 
distinguish between anti-ICAM-1 and anti-IgA activity. 

Patients are screened for the development of an allergic reaction to the 
immunoadhesin. (In previous studies, there were no episodes of adverse reactions with 
soluble ICAM applied topically in the nose or plantibodies applied topically in the oral 
cavity.) Individuals exhibiting symptoms of nasal allergy are tested for 
anti-immunoadhesin-specific IgE antibodies in nasal lavage fluids using a sensitive two-step 
ELISA (R&D Systems). 

E. Statistical Analysis. 

The sample size for these studies is based on previous studies using the rhinovirus 
challenge model. The sample size planned for the protection studies should be adequate to 
detect a reduction in the incidence of clinical colds from 75% in the placebo groups to 25% in 
the active treatment groups at 1-sided levels of a = .05 and 1=0 - .80. In addition, the sample 
size should be adequate to detect a change in the total symptom score of 5 units assuming an 
SD of 5.8 units. 
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9. Clinical Studies Demonstrating the Ability of the Immunoadhesin to Reduce 
Infectivitv in Humans: Challenge Studies 

Challenge studies are used to demonstrate that treatment with the immunoadhesin of 
the present invention protect against clinical colds or reduce cold symptoms after viral 
5 challenge. 

A. Challenge Virus. 

The challenge virus used for this study is rhinovirus 39 (HRV-39). Rhinovirus 
type 39 is a major group of rhinovirus that requires ICAM-1 for attachment to cells.* The 
challenge virus pool is safety-tested according to consensus guidelines (Gwaltney, et al, 
10 Prog. Med. Virol 39:256-263, 1992). All subjects are inoculated with approximately 200 
median tissue culture infective dose (TCID50). The virus are administered as drops in two 
inocula of 250 jil per nostril given approximately 15 minutes apart while the subjects are 
supine. , 
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B. Study Design. 



Two randomized rhinovirus challenge studies are performed (see Table 1). The same 
formulation of the immunoadhesin of the present invention is evaluated in pre-inoculation and 
post-inoculation studies. In both studies, medication is administered as six doses each day for 
five days. Subjects are randomly assigned to receive either the irnmunoadhesin or matching 
placebo at the time of enrollment into each study. The study is blinded and all clinical trial 
personnel, subjects, and employees of Panorama Research remain blinded until all data are 
collected. 

In the pre-inoculation study, medications are started four hours (two doses) prior to 
viral challenge. The virus challenge is administered one hour after the second dose of the 
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immunoadhesin (or placebo) and the four remaining doses of study medication for the first 
day are given as scheduled. In this study eighteen subjects receive the active treatment and 
eighteen subjects receive placebo. 

In the post-inoculation study, medications begin 24 hours after virus challenge. This 
5 timepoint was chosen because it has been used in other studies of protection from virus 
challenge, and because cold symptoms are clearly present (Harris & Gwaltney, Clin. Infect. 
Dis. 23:1287-90, 1996). Virus challenge in this study is administered in the morning of study 
day 0 approximately 24 hours prior to the first dose of study medication on the morning of 
study day 1. In this study, 36 subjects receive the active treatment and 18 subjects receive 
10 placebo. 

Subjects are isolated in individual hotel rooms from study day 0 (the day of virus 
challenge) to study day 6. On each of these days a symptom score and a nasal lavage for 
virus isolation are done in the morning prior to the first dose of medication and a second 
symptom score is done each evening. On study day 6, subjects are released from isolation but 
15 continue to record symptom scores each evening through day 14. The subjects return'to' the 
study site on study day 21, when a final serum sample for detection of anti-immunoadhesin 
antibodies will be collected. The total amount of immunoadhesin to be used in the two virus 
challenge studies (on a total of 54 subjects) is approximately 1200 mg. 

C. Viral Isolation. 

20 Virus shedding is detected by virus isolation in cell culture. Nasal wash specimens are 

collected by instillation of 5 ml of 0.9% saline into each nostril. This wash is then expelled 
into a plastic cup and kept chilled for one to two hours until it is processed for viral cultures. 
Immunoadhesin is removed from the specimens by treatment with anti-ICAM-1 antibody 
adsorbed to an agarose support (Affi-Gel 10, Bio-Rad Laboratories, Hercules, CA). A portion 

25 of each processed specimen is stored at -80 °C, and another portion is inoculated into two 
tubes of HeLa-1 cells, a HeLa cell line enriched for the production of ICAM-1 Arruda, et al, 
J. Clin. Microb. 34:1277-1279, 1996). Rhinovirus are identified by the development of 
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typical cytopathic effect. Subjects with a positive viral culture on any of the postchallenge 
* study days are considered infected. Viral titers in the specimens stored at -80 °C are 
determined by culturing serial ten-fold dilutions in microtiter plates of HeLa-1 cells. 

Antibody to the challenge virus are detected by serum neutralizing titers done using 
standard methods Gwaltney, et al, Diagnostic Procedures for Viral Rickettsial and 
Chlamydial Infections, p. 579-614, American Public Health Association). Serum specimens 
for antibody testing are collected during screening, immediately prior to virus challenge 
(acute), and again 21 days later (convalescent). Subjects with at least a four-fold rise in 
antibody titer to the challenge virus when the convalescent serum sample is compared with 
the acute serum sample are considered infected. 

D, Evaluation of Illness Severity 

Illness severity is assessed as previously described (Turner, et al, JAMA 
281:1797-804, 1999). Symptom scores are recorded prior to virus challenge (baseline) and 
twice each day at approximately twelve-hour intervals for the next 6 days. On studjrdays 7 ' 
through 14 each subject records his/her symptom score once per day in the evening. At each 
evaluation, subjects are asked to judge the maximum severity of the following.eight 
symptoms in the interval since the last symptom evaluation: sneezing, rhinorrhea, nasal 
obstruction, sore throat, cough, headache, malaise, and chilliness. Each symptom is assigned 
a severity score of 0 to 3 corresponding to a report of symptom severity of absent, mild, 
moderate, or severe. If symptoms are present at baseline, the baseline symptom score will be 
subtracted from the reported symptom score. The higher of the two daily evaluations are 
taken as the daily symptom score for each symptom. The daily symptom scores for the eight 
individual symptoms are summed to yield the total daily symptom score. The total daily 
symptom scores for the first 5 days after virus challenge (study days 1-5) are summed and on 
the evening of study day 5, all subjects are asked, "Do you feel you have had a cold?" 
Subjects who had a total symptom score of at least 6 and either at least three days of 
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rhinorrhea or the subjective impression that they had a cold are defined as having a clinical 
cold. 

The weight of expelled nasal secretions is determined on days 1-7 by providing all 
subjects with packets of preweighed nasal tissues. After the tissues are used they are stored in 
5 an airtight plastic bag. Each morning the used tissues, together with any unused tissues from 
the original packet, are collected and weighed, 

E. IL-8 Assay. 

Recent studies have suggested that the host inflammatory response, particularly 
interleuMn 8 (IL-8), may play a role in the pathogenesis of common cold symptoms due to 
10 rhinovirus infection. Concentrations of EL-8 in nasal lavage are determined with a 

commercially available ELISA (R&D Systems, Minneapolis, Minn) as previously described 
(Turner, etal, JAMA 281:1797-804, 1999). 

F. Safety Evaluations. 

The same evaluations are done in the challenge study as in the dose escalation study 
15 described in Example 8. 

G. Statistical Analysis. 

Statistical analysis is performed similarly as to that described for the dose escalation 
study described in Example 8. 
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Claims 

1. An immunoadhesin comprising a chimeric ICAM-1 molecule, said chimeric ICAM-1 
molecule comprising: 

a rhinovirus receptor protein linked to at least a portion of an immunoglobulin heavy chain; 
and 

J chain and secretory component associated with said chimeric ICAM-1 molecule. 

2. The immunoadhesin of claim 1 wherein said rhinovirus receptor protein is comprised of 

any combination of extracellular domains 1 , 2, 3, 4 and 5 oflCAM- 1 . 

3. The immunoadhesin of claim 1 wherein said immunoglobulin is selected from the group of 
IgA, IgAi, IgA2, IgM, and chimeric immunoglobulin heavy chains. 

4. The immunoadhesin of claim 1 further comprising at least one additional chimeric ICAM- , 
1 molecule. 

5. The immunoadhesin of claim 1 wherein said rhinovirus receptor protein is comprised of 
any combination of extracellular domains 1, 2, 3, 4 and 5 of ICAM-1; and said 
immunoglobulin heavy chain comprises at least a portion of an IgA 2 heavy chain! 

6. The immunoadhesin of claim 1 expressed in transgenic plants. 

7. The immunoadhesin of claim I expressed in monocotyledonous plants. 

8. Hie immunoadhesin of claim 1 expressed in dicotyledonous plants. 

9. The immunoadhesin of claim 1 wherein all proteins are human. 

10. The immunoadhesin of claim 1 expressed in heterologous cells derived from plants, 
vertebrates, or invertebrates. 
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11- The immunoadhesin of claim 1 expressed in mammalian cells. 

12. The immunoadhesin of claim 1 expressed in hairy root cultures. 

13. The immunoadhesin of claim 1 expressed in plant cells in tissue culture. 

14. An immunoadhesin comprising a chimeric ICAM-1 molecule, said chimeric ICAM-1 
5 molecule comprising: a rhino virus receptor protein linked to at least a portion of an 

immunoglobulin heavy chain, wherein immunoadhesin has plant-specific glycosylation. 

15. The immunoadhesin of claim 14 wherein said immunoadhesin further comprises a J chain 
and secretory component associated with said chimeric ICAM-1 molecule. 

1 6. The immunoadhesin of claim 14 wherein said rhinovirus receptor protein is comprised of 
10 any combination of extracellular domains 1,2, 3, 4 and 5 of ICAM-1. 

17. The immunoadhesin of claim 14 wherein said immunoglobulin heavy chain is selected 
from the group of IgA, IgAi , IgA 2 , lgG u IgG 2 , IgG 3 , IgG 4 , IgM, IgD, IgE, and a chimeric 
immunoglobulin heavy chain. 

1 8 . The immunoadhesin of claim 1 4 further comprising at least one additional chimeric 
15 ICAM-1 molecule. . . 

19. The immunoadhesin of claim 14 wherein said rhinovirus receptor protein is comprised of 
any combination of extracellular domains 1, 2, 3, 4 and 5 of ICAM-1 ; and said 
immunoglobulin heavy chain comprises at least a portion of an IgA 2 heavy chain. 

20. The immunoadhesin of claim 14 wherein all proteins are human. 

20 21 . The immunoadhesin of claim 14 expressed in heterologous cells derived from plants. 

22. The immunoadhesin of claim 14 expressed in hairy root cultures. 

23 . The immunoadhesin of claim 14 expressed in plant cells in tissue culture. 
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24. The immunoadhesin of claim 14 expressed in plants. 

25. The immunoadhesin of claim 14 expressed in monocotyledonous plants. 

26. The immunoadhesin of claim 14 expressed in dicotyledonous plants. 

27. A composition comprising an immunoadhesin and plant material, wherein 

said immunoadhesin comprises a chimeric ICAM-1 molecule, said chimeric ICAM-1 
molecule comprising a rhinovirus receptor protein linked to at least a portion of an 
immunoglobulin heavy chain. 

28. The composition of claim 27 further comprising a J chain and secretory component 
associated with said chimeric ICAM-1 molecule. 

29. A composition of claim 27 wherein said rhinovirus receptor protein is comprised of any 
combination of extracellular domains 1, 2, 3, 4 and 5 of ICAM-1; and said immunoadhesin 
has plant-specific glycosylation. . 

30. A composition of claim 27 wherein said immunoglobulin is selected from the group of * 
IgA, IgA b IgA 2 > IgGr, IgG 2 , IgG 3 , lgG 4 , IgM, IgD, IgE, and a chimeric inmiunogfobulin 
heavy chain. 

31. A composition of claim 27 further comprising at least one additional chimeric ICAM-1 
molecule. 

32. A composition of claim 27 wherein said rhinovirus receptor protein is comprised of 
any combination of extracellular domains 1, 2, 3, 4 and 5 of ICAM-1; and 

said immunoglobulin heavy chain is an IgA 2 heavy chain 

33. A method for reducing the infection by human rhinovirus of host cells susceptible to 
infection by human rhinovirus, said method comprising: 
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contacting the virus with an immunoadhesin of claim 1, 14, or 27, and wherein 
said immunoadhesin binds to human rhinovirus and reduces infectivity thereof. 

34. A method for reducing the initiation or spread of the common cold due to human 
rhinovirus, said method comprising: 

contacting the virus with an immunoadhesin of claim 1, 14 or 27, and wherein 
said immunoadhesin binds to human rhinovirus and reduces infectivity thereof. 

35. A method for the treatment or prevention of human rhinovirus infection in a human 
subject, said method comprising: 

aoininistering to said subject an effective amount of an immunoadhesin of claim 1, 14or27, 
and wherein said immunoadhesin reduces human rhinovirus infectivity thereof. 

36. A method for the treatment or prevention of human rhinovirus infection in a subject, said 
method comprising: 1 



intranasally administering to said subject an effective amount of an immunoadhesin of claim 
1, 14 or 27, and wherein said immunoadhesin reduces human rhinovirus infectivity thereof. 

37. A method for the treatment or prevention of human rhinovirus infection in a subject, said 
method comprising: a(lministering through the oral cavity to said subject an effective amount 
of an immunoadhesin of claim 1, 14 or 27, and wherein said immunoadhesin reduces human 
rhinovirus infectivity thereof. 

38. A pharmaceutical composition comprising an immunoadhesin of claims 1, 14, or 27 in a 
pharmaceutically acceptable buffer. 

39. An expression vector comprising a gene encoding a chimeric ICAM-1 molecule 
operatively linked to aplant promoter, said chimeric ICAM-1 molecule comprising a 
rhinovirus receptor protein linked to at least a portion of an immunoglobulin heavy chain, 
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FIGURE 6 
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FIGURE 7 A 



I. HUMAN IG ALPHA-1 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>sp|P01876|ALCl_HUMAN IG ALPHA-1 CHAIN C REGION - Homo sapiens (Human) . 



10 20 30 40 50 60 

I I i I I I 

ASPTSPKVFP LSLCSTQPDG NWIACLVQG FFPQEPLSVT WSESGQG VTA RNFPPSQDAS 

70 80 90 • 100 110 120 

I I I I I ) 

GDLYTTSSQL TI,PATQCLAG KSVTCHVKHY TOPSQDVTVP CPVPSTPPTP SPSTPPTPSP 



130 140 150 160 170 180 

I I ( ( I ( 

SCCHPRLSLH RPALEDLLLG SEANLTCTLT GLRDASGVTF TWTPSSGKSA VQGPPERDIiC 



190 200 210 220 230 240 

I I I I I I 

GCYSVSSVLP GCAEPWNHGK TFTCTAAYPE SKTPLTATLS KSGNTFRPEV HLLPPPSEEL 

250 260 270 280 290 300 

I I ! I I I 

AIiNELVTIiTC LARGFSPKDV LVRWIiQGSQE LPREKYLTWA SRQEPSQGTT TFAVTSILRV 



310 



320 



330 



340 



350 



AAEDWKKGDT FSCMVGHEAL PLAFTQKTID RLAGKPTHVN VSWMAEVBG TCY 



CODING SEQUENCE 



catccccgac 
acgtggtcat 
ggagcgaaag 
gggacctgta 
agtccgtgac 
gcccagttcc 
catgccgcca 
cagaagcgaa 
cctggacgcc 
gctgctacag 
ccttcacttg 
aatccggaaa 
ccctgaacga 
tggttcgctg 
cccggcagga 
cagccgagga 
cgctggcctt 
tgtctgttgt 



cagccccaag 
cgcctgcctg 
cggacagggc 
caccacgagc 
atgccacgtg 
ctcaactcca 
cccccgactg 
cctcacgtgc 
ctcaagtggg 
cgtgtccagt 
cactgctgcc 
cacattccgg 
gctggtgacg 
gctgcagggg 
gcccagccag 
ctggaagaag 
cacacagaag 
catggcggag 



gtcttcccgc 
gtccagggct 
gtgaccgcca 
agccagctga 
aagcactaca 
cctaccccat 
tcactgcacc 
acactgaccg 
aagagcgctg 
gtcctgccgg 
taccccgagt 
cccgaggtcc 
ctgacgtgcc 
tcacaggagc 
ggcaccacca 
ggggacacct 
accatcgacc 
gtggacggca 



tgagcctctg 
tcttccccca 
gaaacttccc 
ccctgccggc 
cgaatcccag 
ctccctcaac 
gaccggccct 
gcctgagaga 
ttcaaggacc 
gctgtgccga 
ccaagacccc 
acctgctgcc 
tggcacgcgg 
tgccccgcga 
ccttcgctgt 
tctcctgcat 
gcttggcggg 
cctgctactg 



cagcacccag 
ggagccactc 
acccagccag 
cacacagtgc 
ccaggatgtg 
tccacctacc 
cgaggacctg 
tgcctcaggt 
acctgagcgt 
gccatggaac 
gctaaccgcc 
gccgccgtcg 
cttcagcccc 
gaagtacctg 
gaccagcata 
ggtgggccac 
taaacccacc 



ccagatggga 
agtgtgacct 
gatgcctccg 
ctagccggca 
actgtgccct 
ccatctccct 
ctcttaggtt 
gtcaccttca 
gacctctgtg 
catgggaaga 
accctctcaa 
gaggagctgg 
aaggacgtgc 
acttgggcat 
ctgcgcgtgg 
gaggccctgc 
catgtcaatg 



-1 
60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1061 



GenBaxik . 
J00220 

LOOJS HUMIGCC8 2533 bp DNA 

DEFINITION Homo sapiens immunoglobulin 

(IGHA1) gene, partial cds. 

ACCESSION J00220 

VERSION J00220.1 GI:184743 
KEYWORDS 

SOURCE human . 



PRI 02-DEC-1998 
alpha- 1 heavy chain constant region 
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FIGURE 7 



B 



ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
COMMENT 



FEATURES 

source 



iritron 
exon 
gene 
CDS 



Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ■ 
Mammalia; Eutheria; Primates; Catzarrhini; Eominidae ; Homo 
1 (bases l to 2533) 

Takahashi,N., Ueda,s., obata,M., Nikaifiq,T., Nakai,s. and Honjo,T. 
Structure of human immunoglobulin gamma genes: Implications for' 
evolution of a gene family 
Cell 29, 671-679 (1982) 
83001343 

This sequence is part of a multigene region containing the 
immunoglobulin heavy chain gamma- 3, garama-1, pseudo-epsilon arid 
alpha- 1 genes. 

Location/Qualifiers 

1..2533 

/organ ism= "Homo sapiens" 
/db_xre f = » t axon i960€ tt 
/chromo some=s "14" 
/map=°14q32 ,33" 

/clone»»cosmid Igi3 ; Ig-gamma3-122' 1 
. /tissue_type= "placenta; liver" 
/gertnline 
<1..141 

/not e*> "aloha- 1 intron J-c n 
142.. 447 
/gene*= n IGHAl n 
<142..>1638 

/gene= " IGHA1 B 

join{<142. .447,662. .1021,1244. .1638) 
/gene= H IGHAl n 
/codon_start»3 

/product*" immunoglobulin alpha 
region" 

/pr ot e in^id= '» AAC8 2 5 2 8 . 1 11 
/db_xref ="GI : 184749" 

/translation^ "SPTSPKVFPLSLCSTQPrKSNWIAC^VQGFFPQEPLSVTWSESG 
QGVTARNFPPSQDASGDL YT7S S QLTLPATOCLAGKS VTCHVKHYTNPSODVTVPCPV 
PSTPPTPSPSTPPTPSPSCCHPRLSLHRPALEDLLLGSSANLTCTLtGLRDASGVTFT 
WTPSSGKSAVQGPPERDLCGCTSVSSVLPGC7^PWNHGKITTCTAAYPESKTPLTATL 

SKSGNTFRPEVHLLPPPSEELAIiNEIjVTT iTfTT.RPnFG Dm\7T ArDWT.rkrt eor«T 



heavy chain constant 



intron 

exbn 
intron 



BASE COUNT 

ORIGItf 

1 ggtccaactg 
61 tctgtgctgg 
121 acagtgcatt 
181 gcagcaccca 
241 aggagccact 
301 cacccagcca 
361 ccacacagtg 
421 gccaggatgt 
481 accccgtcgt 
541 actcacacca 



490 a 



448. .661 
/gene^iGHAi" 
/note="A" 

S62. .1021 

/gene^iGHAl" 
1022. .1243 
/genes "ighai » 
/notes'' B" 
1244 . .>1638 
/gene= n lGHAl» 

S66 c 753 g 



caggcctgtg 
gttcctccag 
ctgtgttcca 
gccagatggg 
cagtgtgacc 
ggatgcctcc 
cctagccggc 
gactgtgccc 
gccctgacac 
gagtggaccc 



gtgcaggagc 
tatagaggag 
gcatccccga 
aacgtggtca 
tggagcgaaa 
ggggacctgt 
aagtccgtga 
tgcccaggtc 
tgcgcctgca 
cgggccgagc 



424 t 

tgtgtgacca 
aggcagtata 
ccagccccaa 
tcgcctgcct 
gcggacaggg 
acaccacgag 
catgccacgt 
agagggcagg 
cccgtgttcc 
cccaggaggt 



t9999ctgtc 
gaggagaggg 
ggtcttcccg 
ggtccagggc 
cgtgaccgcc 
cagccagctg 
gaagcactac 
ctggggagtg 
ccacagggag 
99tggtggac 



accaggcctc 
ccgcgtcctc 
ctgagcctct 
ttcttccccc 
agaaacfctcc 
accctgccgg 
acgaatccca 

999cggggcc 

ccgccccttc 
aggccaggag 
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601 
661 
721 
731 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 

1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 



gggcgaggcg 
gttccctcaa 
tgccaccccc 
gcgaacctca 
acgccctcaa 
tacagcgtgt 
acttgcactg 

ggtgggtcca 

ggcctgcctc 
tgaggcaggc 
tcatcccctg 
gaggtccacc 
acgtgcctgg 
caggagctgc 
accaccacct 
gacaccttct 
atcgaccgct 
gacggcacct 
ccaagcagcc 
gggaaagggc 
ctggcacccc 
gtggtgcagg 
gatgggttct 
ttgtccctga 
caggctagcc 
tccagggccc 
tcaggacgtc 
acggggcacg 
gcggaacgtt 
cccaagctca 
atcaggcacc 
tcccatccct 
gtggcatcag 



ggggcatggg 
ctccacctac 
gactgtcact 
cgtgcacact 
gtgggaagag 
ccagtgtcct 
ctgcctaccc 
gaccctgctc 
ctccctccca 
tgaggggcca 
cccagecccc 
tgctgccgcc 
cacgcggctt 
cccgegagaa 
tcgctgtgac 
cctgcatggt 
tggcgggtaa 
gctactgagc 
ccacgcttcc 
cagggcatgg^ 
catgagactt 
aggcctcgct 
gagagacgca 
gagagccgcc 
aagaaaggcg 
cactgggagg 
ccactgacgc 
tgggcacagg 
cacagcccag 
gcetgctccc 
aactccacag 
fccctaagccc 
aag 



gaagtatgtg 
cccatctccc 
gcaccgaccg 
gaccggcctg 
cgctgttcaa 
gccgggctgt 
cgagtccaag 
ggggccctgc 
atcctgggct 
ggacacagcc 
tgacctggct 
gccgtcggag 
cagccccaag 
gtacctgact 
cagcatacbg 
gggccacgag 
acccacccat 
cgcccgcctg 
atccggcgcc 
acagggaaga 
tccaccctgg 
ggtgtgagat 
catgccccgg 
tctggctgta 
caggtagtgg 
acaagctcca 
atgcaggaag 
tgcacactsa 
acccagttcc 
ctaggcccct 
acccctccca 
aactaggacc 



ctgaccagct 
tcaactccac 
gccctcgagg 
agagatgcct 
ggaccacctg 
gccgagccac 
accccgctaa 
tcagtgctct 
ccagtgctca 
cggggfcgccc 
ctctaccctc 
gagctggccc 
gacgtgctgg 
tgggcatccc 
cgcgtggcag 
gccctgccgc 
gtcaatgtgt 
tccccacccc 
tgtctgtcca 
ataccccctg 
tgtgagtgtg 
cttaggtctg 
acagttatga 
gctgggaggg 
caggagcggc 
ggagggcccc 
gggcacctcc 
cactcacata 
agaaaagcca 
ctggcttccc 
ggcagcccct 
caaagcatag 



caggccatct 
ctaccccatc 
acctgctctt 
caggtgtcac 
agcctgacct 
ggaaccatgg 
ccgccaccct 
ggtttgcaaa 
tgccaagtac 
accagagcag 
caggaaacac 
tgaacgagct 
ttcgctggct 
ggcaggagcc 
ccgaggactg 
tggccttcac 
ctgttgtcat 
tgaataaact 
tcctcagggt 
ccctgagcct 
agttgtgagt 
ccaaggcagg 
gtgagcagtg 
aatagggagg 
gagggagtga 
accaccctag 
ccttaaccac 
tatgcctgag 

ggggagtccc 

tgtgtttcca 
gctccctgcc 
acagggaggg 



ctccactcca 
tccctcatgc 
aggtccagaa 
cttcacctgg 
ctgtggctgc 
gaagaccttc 
ctcaaaatcc 
gcatattcct 
acagggaaac 
aggggctctc 
attccggccc 
ggtgacgc tg 
gcaggggtca 
cagccagggc 
gaagaagggg 
acagaagacc 
ggcggaggtg 
. ccatgctccc 
ctcagcactt 
cggggggccc 
gtgagagtgt 
cacagcccag 
gcatggccgt 
gtaaaaggag 
ggggctggac 
tgggtgggcc^ 
actgctctgt 
ccctgcagga 
ctcccaagcc 
ctgtgcacag 
tggccaagtc 
gccacgtggg 



n. HUMAN IG ALPHA-2 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

> S p|P01877 |ALC2_HUMAN IG ALPHA- 2 CHAIN C REGION - Homo sapiens (Human) . 

2 ° 20 30 40 50 60 

ASPTSPKVFP LSLDSTPQDG NWVACLVQG FFPQEPLSVT WSESGQNVTA RNFPPSQDAS 



70 



80 



90 



100 



110 



120 
I 



GDLYTTSSQL TLPATQCPDG KSVTCHVKHY TNPSQDVTVP CPVPPPPPCC HPRLSLHRPA 



130 



140 



150 



160 



170 



180 



LEDLLLGSEA NLTCTLTGLR DASGATFTWT PSSGKSAVQG PPERDLCGCY SVSSVLPGCA 



190 



200 



210 



220 



230 



240 



QPWNHGETFT CTAAHPELKT PLTANITKSG NTFRPEVHLL PPPSEELALN ELVTLTCLAR 



250 



260 



270 



280 



290 



300 



GPSPKDVLVR WLQGSQELPR EKYLTWASRQ EPSQ3TTTFA VTSILRVAAE DWKKGDTFSC 
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340 



MVGHEALPLA PTQKTIDRIA GKPTHVNVSV VMAEVDGTCY 



CODING SEQUENCE 



catccccgac 
acgtggtcgt 
ggagcgaaag 
gggacctgta 
agtccgtgac 
gcccagttcc 
tcgaggacct 
atgcctctgg 
cacctgagcg 
agccatggaa 
cactaaccgc 
cgccgccgtc 
gcfctcagccc 
agaagtacct 
tgaccagcat 
fcggtgggcca 
gtaaacccac 

ga 



cagccccaag 
cgcatgcctg 
cggacagaac 
caccacgagc 
atgccacgtg 
cccacctccc 
gctcttaggt 
tgccaccttc 
tgacctctgt 
ccabggggag 
caacatcaca 
ggaggagctg 
caaggatgtg 
gacttgggca 
actgcgcgtg 
cgaggccctg 
ccatgtcaat 



gtcttcccgc 
gbccagggct 
gtgaccgcca 
agccagctga 
aagcactaca 
ccatgctgcc 
tcagaagcga 
acctggacgc 
ggctgctaca 
accttcacct 
aaatccggaa 
gccctgaacg 
ctggttcgct 
tcccggcagg 
gcagccgagg 
ccgctggcct 
gtgtctgttg 



tgagcctcga 
tcttccccca 
gaaacctccc 
ccctgccggc 
cgaatcccag 
acccccgact 
acctcacgtg 
cctcaagtgg 
gcgtgtccag 
gcactgctgc 
acacattccg 
agctggtgac 
ggctgcaggg 
agcccagcca 
actggaagaa 
tcacacagaa 
tcatggcgga 



cagcaccccc 
ggagccactc 
acctagccag 
cacacagtgc 
ccaggatgtg 
gtcgctgcac 
cacactgacc 
gaagagcgct 
tgtcctgcct 
ccaccccgag 
gcccgaggtc 
gctgacgtgc 
gtcacaggag 
gggcaccacc 
99999acacc 
gaccatcgac 
ggtggacggc 



caagatggga 
agtgtgacct 
gatgcctccg 
ccagacggca 
actgtgccct 
cgaccggccc 
ggcctgagag 
gttcaaggac 
ggctgtgccc 
ttgaagaccc 
cacctgctgc 
ctggcacgtg 
ctgccccgcg 
accttcgctg 
ttctcctgca 
cgcttggcgg 
acctgctact 



-1 
60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
500 
960 
1020 
1022 



GenBank 

J00221 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



Human ig germline 
HUMIGCD7 2516 bp DNA pri U-APR-2001 

Human Ig germline H-chairi G-E-A region B: alpha-2 A2m(l) allele 
constant region, 3* end. 
J00221 

J00221.1 GI:184756 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 {bases 1 to 2516) 
Ellison, J. and Hood, L. 

Linkage and sequence homology of two human immunoglobulin gamma 
heavy chain constant region genes 

Proc. Natl. Acad. Sex. U.S.A. 79 (S) , 1984-1988 (1982) 

82197621 

.6804949 

2 (bases 737 to i016) 
Flanagan, J. G. and Rabbi ttB,t.H. 

Arrangement of human immunoglobulin heavy chain constant region 

genes- implies evolutionary duplication of a segment containing 

gamma , epsilon and alpha genes 

Nature 300 (5894) , 709-713 (1982) 

83088998 

6817141 

3 (bases 49 tb 229; 425 to 514) 

Hisa;Uma,H., Nishida,Y., Na*ai,s., Takahashi,N. , Ueda,S and 
Hon j o , T . 

Structure of the human immunoglobulin C epsilon 2 gene, a truncated 

pseudogene: implications for its evolutionary origin 

Proc. Natl. Acad. Sci. U.S.A. 80 (10), 2995-2999 (1983) 

63221529 

6407005 

4 (bases 1 to 2516) 

Flanagan, J. G., Lefranc/M.P. and Rabbit ts,T.H. ( 
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FIGURE 7 



TITIiB Mechanisms of divergence and convergence of the human 

immunoglobulin alpha 1 and alpha 2 constant region gene sequences 
JOURNAL Cell 36 (3) , 681-688 (1984) 
MEDLINE 84130179 
PUBMBD 6421489 

COMMENT [33 also reports the complete alpha- 1 gene and part of the A2m(2) 

alpha-2 allele (bases 737-2516; see Sites table). Comparison of the 
three sequences suggests that the A2m(l) alpha-2 allele might be a 
hybrid of the alpha-1 gene and A2m(2) alpha-2 allele. The hinge 
region in the alpha genes occurs at beginning of the CH2 domain. 
The alpha-1 hinge region is 13 amino acids longer than that in 
alpha-2. Both hinge regions consist of approximate tandem repeats 
of a 15 bp sequence. The first repeat occurs 5' to the mRWA splice 
site and is non- coding. The authors [3] suggest that this 
repetitive structure provides a possible mechanism for the large 
number of variations observed in hinge regions. There is a coupled 
30 bp insertion, . 9 bp deletion in alpha-2 relative' to alpha-1 
(starting at base 97) . 

tl3 also reports sequences for the epsilon-1 and epsilon-2 
(pseudogene) C-region genes. The authors [1] determined the 
physical linkage between epsilon-1 and alpha-2 and that between 
epsxlon-2 and alpha-1. [2] also reports the alpha-1 CH2 domain and 
cpoilon-2 . 

This entry is part of a raultigene region (region B) , which includes 
the gamma-2, gamma-4, epsilon-L and alpha-2 genes. See segment 1 
for more comments. 
Complete source information: 

Human genomic DNA, cosmid IglO [1] , [3] ; placenta DNA [2] clone 
H-Ig-alpha-25; genomic DNA from TOU II-5 library clone 
larabda-TOU-alpha2 (forA2m(2] allele) [3]. 
Location/Qualifiers 
1. .2516 

/organism= "Homo sapiens" 
/db_xref = " taxon : 96 0 6 w 
/map=»14q32. 33" 
/germline 
<1..1621 
/gene«"IgH" 
/note« p IGHA2 n 
<1..163 
/gehe="lgH" 

/note= M alpha-2 intron J-C" 
join(<164. .469,684. .1004,1227. .1621) 
/gene^IgH" 

/note= n contains constant region" 
/codon_start»3 

/product^ "immunoglobulin alpha-2 heavy chain" 
/protein_id= w AAB593 96 . 1" 
/db_xref ="GI : 184761 ■ 

/ 1 ran Bl a t i on= " S PTS PKVFPLSLDSTPQDCSNVVVACLVQGFFPQEPIiSVTWSESG 
QIWTARWF?PSQDA3GDLYTTSSQLTLPATQCPDGKSVTCHVKHYrNPSQDVTVFCPV 
PPPPPCCHPRI^IiHRPALEDLLLGSEA^TCn^TGLR 

PEPJJLCGCYSVSSvLPGCAQPWNHGETFTCTAAHPELKTPLTAHITKSGlTrF^PEVHL 
LPPPSEELALNELVTLTCTj^GFSPKDVLVRMl^GSQELPREKYLrWASRQEPSQGTT 
TFAVTSI LRVAAED W KKGDT F S CMVGHEALPLAFTQKITDIU^GKPTHVNVSW 
DGTOY" 

exon 164 . .469 

/gene= w IgH" 
/note= "000-119-333" 
intron 470.. 683 

/gene=«IgH» 

/note- "alpha-2 intron A" 
ekon 684.. 1004 



FEATURES 

source 



gene 



intron 



CDS 
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. /gene« " IgH" 
intron 1005 . . 1226 

/gene= ,, rgH» 

/note="alpha-2 Intron B" 

exon 1227.. 1621 

/gene=r"IgH» 
variation 1434 

/gene=»IgH" 

/note="t in A2mU>; a in A2m(2) 
variation 1441 

/genes n IgH" 

/note="g in A2m(l) 
variation 1465 

/gene="IgH n 

/note="c in A2m(l) ; t ,in A2m(2) ■ 
variation 1486 

/gene» n IgH" 

/note=» w c in A2m(l) 
variation 1553. 

/gene= n IgH» 

/note« w t in A2m(l) 
variation 1573 . . 1574 

/gene- "IgH" 

/note= w tg in A2m(l) ; ca in A2m(2) 
variation 1602 . . 1605 

/gene="IgH u 
/note= tt tggac in A2m(l 
variation 2060 

/note="c in A2m(l) ; t 
variation ,2384 

/note="a in A2m(l) ; c in A2m(2) 
variation 2390 

/notes" c in A2m(l) ; 
861 c 754 g 



a in A2m(2) 1 



g in A2m(2) ° 



a in A2m (2) " 



; eggat in A2rci(2) " 
in A2m(2) B 



g 



488 a 



in A2m(2) " 
413 t 



BASE COUNT 
ORIGIN 

1 ggtccaaccg caggcccatg gtgeaggage tgtgtaacct atggggctgt caccaggcct 
61 ctctgtgctg ggttcctcca gtgtagagga gaggcaggta cagcctgtcc tcctggggac 
121 atggcatgag ggccgcgtcc tcacagcgca ttctgtgttc cagcatcccc gaccagcccc 
181 aaggtcttcc cgctgagcct cgacagcacc ccccaagatg ggaacgtggt cgtcgcatgc 
241 ctggtccagg gcttcttccc ccaggagcca ctcagtgtga ectggagega aagcggacag 
301 aacgtgaccg ccagaaactt cccacctagc caggatgect ccggggacct gtacaccacg 
361 agcagccagc tgaccctgcc ggccacacag tgcccagacg gcaagcccgt gacatgccac 
421 gtgaagcact acacgaatcc cagecaggat gtgactgtgc cctgcccagg tcagagggca 
481 ggctggggag tggggcgggg ccaccccgtc ctgccctgac actgcgcctg cacccgtgtt 
541 ccccacaggg agccgcccct tcactcacac cagagtggac cccgggccga gccccaggag 
601 gtggtggtgg acaggecagg aggggegagg egggggcacg gggaagggcg ttctgaccag 
661 ctcaggccat ctctccactc cagttccccc acctccccca tgctgccacc cccgactgtc 
721 gctgcaccga ccggccctcg aggacctget cttaggttca gaagegaace tcacgtgcac 
781 actgaccggc ctgagagatg cctctggtgc caccttcacc tggacgccct caagtgggaa 
841 gagcgctgtt caaggaccac ctgagcgtga cctctgtggc tgctacagcg tgtccagtgt 
901 cctgcctggc tgtgcccagc catggaacca tggggagacc ttcaectgea ctgctgccca 
961 ccccgagttg aagaccccac taaccgccaa catcacaaaa tecggtgggt ccagaccctg 
1021 ctcggggccc tgctcagtgc tctggtttgc aaagcatatt cccggcctgc ctcctccctc 
1081 ccaatcctgg gctccagtgc teatgecaag tacacaggga aactgaggca ggctgagggg 
1141 ccaggacaca geccagggtg cccaccagag cagaggggefc ctctcatccc ctgcccagcc 
1201 ccctgacctg gctctctacc ctccaggaaa cacattccgg cccgaggtcc acctgctgcc 
1261 gccgccgtcg gaggagctgg ccctgaacga gctggtgacg ctgacgtgcc tggcacgtgg 
1321 cttcagcccc aaggatgtgc tggttcgctg getgeagggg tcacaggagc tgccccgcga 
1381 gaagtacctg acttgggcat cccggcagga gcccagccag ggcaccacca ccttcgctgt 
1441 gaccagcata ctgcgcgtgg cagecgagga ctggaagaag ggggacacct tctcctgcat 
1501 ggtgggccac gaggccctgc cgctggcctt cacacagaag accatcgacc gcttggcggg 
1S61 taaacccacc catgtcaatg tgtctgttgt catggeggag gtggacggca cctgctactg 
1621 agccgcccgc ctgtccccac ccctgaataa actccatgct cccccaagca gccccacgct 
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1681 bccabccggc gcct9tcfcgt ccabccbcag ggbctcagca ctbgggaaag ggccagggca 
Utl ^acaggga agaatacccc ctgccctgag ccbcgggggg cccctggcac ccecatgaga 
1801 ctttccaccc tggtgtgagt gtgagttgtg agbgtgagag tgbgbggtgc aggaggcctc 
1861 gctggtgtga gatcttaggt ctgccaaggc aggcacagcc caggatgggt tctgagagac 
1921 gcacatgccc cggacagttc tgagtgagca gtggcatggc cgtttgtccc tgagagagcc 
198?. gcctctggct gtagctggga gggaataggg agggtaaaag gagcaggcta gccaagaaag 
2041 gcgcaggtag tggcaggagc ggcgagggag tgaggggctg gactccaggg ccccactggg 
2101 aggacaagct ccaggagggc cccaccaccc tagtgggtgg gcctcaggac gtcccactga 
2161 cgcatgcagg aaggggcacc tccccttaac cacactgctc tgtacggggc acgtgggcac 
2221 acatgcacac tcacactcac atatacgcct gagccctgca ggagtggaac gttcacagcc 
2281 cagacccagt tccagaaaag ccaggggagt cccctcccaa gcccccaagc tcagcctgct 
2341 cccccaggcc cctctggctt ccctgtgttt ccactgtgca cagatcaggc accaactcca 
2401 cagacccctc ccaggcagcc cctgetccct gcctggccaa gtctcccatc ccttcctaag 
2461 cccaactagg acccaaagca tagacaggga ggggccgcgt ggggbggcat cagaag 

DL HUMAN IG GAMMA-1 CHAIN C REGION - HOMO SAPIENS (HUMAN) 

AMINO ACID SEQUENCE 

> 6 p[P01857|GCl_HUMAN IG GAMMA-1 CHAIN C REGION - Homo sapiens (Human). 

X ° 20 30 40 50 60 

1 I I | I I 

ASTKGPSVFP LAPSSKSTSG GTAALGCItVK DYFPEPVTVS WNSGALTSGV HTFPAVLQSS 

70 80 90 100 110 120 

t I | I I I 

GLYSLSSWT VPSSSLGTQT YICNVNHKPS NTKVDKKVEP KSCDKTHTCP PCPAPELLGG 

13 ° 140 150 160 170 180 

PSVFLFPPKP KDTLMISRTP EVTCVWDVS HEDPEVKFNW YVDGVEVHNA KTKPREEQYN • 

190 - 200 210 220 230 240 

■ I I I | I | 

STYRWSVLT VLHQDWLNGK EYKCKVSNKA LPAPIEKTIS KAKGQPREPQ VYTLPPSRDE 

250 2 60 270 280 290 300 

LTKNQVSLTC LVKGFYPSDI AVEWESNGQP ENNYKITPPV LDSDGSFFli SKLTVDKSRW 

310 320 330 

I I I 

QQGNVFSCSV MHEALHNHYT QKSLSLSPGK 
CODING SEQUENCE 



ccbocaccaa gggcccatcg gtcttccccc tggcaccctc ctccaagagc accbctgggg 60 

gcacagcggc cctgggctgc ctggtcaagg actacttccc cgaaccggtg acggtgtcgt 120 

ggaacbcagg cgcccbgacc agcggogtgc acaccttccc ggctgtccta cagtcctcag 180 

gactctactc cctcagcagc gtggbgaccg tgccctccag cagcttgggc acccagacct 240 

acatctgcaa cgtgaatcac aagcccagca acaccaaggt ggacaagaaa gttgagccca 300 

aatctbgtga caaaactcac acatgcccac cgtgcccagc acctgaactc ctggggggac 360 

cgtcagtctt cctcttcccc ccaaaaccca aggacaccct catgatctcc cggacccctg 420 

aggtcacatg cgtggtggtg gacgtgagcc acgaagaccc tgaggtcaag ttcaactggt 480 

acgbggacgg cgbggaggbg cataabgcca agacaaagcc gcgggaggag cagtacaaca 540 

gcacgtaccg ggtggbcagc gbccbcaccg bccbgcacca ggacbggctg aabggcaagg 600 

agbacaagbg caaggbcbcc aacaaagccc bcccagcccc catcgagaaa accatctcca 660 

aagccaaagg gcagccccga gaaccacagg bgbacacccb gcccccabcc cgggatgagc 720 
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H 



tgaccaagaa ccaggtcagc ctgacctgcc tggtcaaagg cttctatccc agcgacatcg 780 

ccgtggagtg ggagagcaat gggcagccgg agaacaacta caagaccacg cctcccgtgc 840 

tggactccga cggctccttc ttcctctaca gcaagctcac cgtggacaag agcaggtggc 900 

agcaggggaa cgtcttctca tgctccgtga tgcacgaggc tcbgcacaac cactacacgc 960 

agaagagcct ctccctgtct ccgggtaaat ga 392 



GenBank 

J00228. 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



HOMIGCC4 2009 bp DNA PRI 02-DEC-1998 

Homo sapiens immunoglobulin gamma-1 heavy chain constant region 
(IGHG1) gene, partial cds. 



J00228 
J00228 



1 GIj184739 



Craniata; Vertebrafca; Euteleostomi; 
Catarrhini ; Hominidae ; Homo . 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
COMMENT 



FEATURES 

source 



human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 2009) 

Takahashi,N. , Ueda,S., Obata,M. , Nikaido,T., Nakai,S. andHonjo,T. 
Structure of human immunoglobulin gamma genes: Implications for 
evolution of a gene family 
Cell 29, 671-679 (1982) 
83001943 

This sequence is part of a multigene region containing the 
immunoglobulin heavy chain gamma-3, garama-1, pseudo-epsilon, and 
alpha-1 genes. 

Location/Qualifiers 
1..2009 

/organism= "Homo sapiens" 
/db_xref = " taxon .•9606" 
/chromosomes ? 14 " 
/raap= , '14q32.33" 

/clone= rt cosmid Igl3; Ig-gamma3-122 , ' 
/tissue_type= "placenta; liver" 
/germline 
<1. .>1803 
/gene= tt IGHGl n 
<1. .209 
/gene="iGHGl n 

join i<2 10. .503,892. .935,1055.-1384,1481. .1803) 
/gene= n IGHGl n 
/codon_start«3 

/product a" immunoglobulin gamma -1 heavy chain constant 
region* 

/protein_id= «AAC82527.1 n 
/db_xref="GI: 184747" 

/ 1 r ans 1 a t ion= " STKGPSVFPLAPSSKSTSGGTAALGCLVKDYFPEPVTVS NNSGA 

LTSGVHTFPAVIOSSGLYSLSSV\n^PSSSLGTQTYiaJvimKPSNTK^^ 

DKTHTCPPCPAPELLGGPSVFLFPPKPKDTIiMISRTPEVTCVV^ 

VDG\^ VHNAKTKPR E EQ YNS T YR W9 VLTVLHQD WLNG KEYKCKVSMKAL P A PI EKT I 

SKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAV^WESNGOPENNYKTT 

P PVLDSDGS F FL YS KLTVTJKS RWQQGNVFSCS VMHE ALHNHYTQKSLSLS PGK" 

misc_dif ference 563 

/gene= 0 IGHGl B 
/replace-" " 

misc_dif ference 593 

/gene=«IGHGl» 
/replace-" 0 

misc — difference 614 

/gene«"IGHGl M 
/replace^ "" 



gene 



intron 



CDS 
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mi sc__dif ference 633 

/gene«"IGHGl " 

/replace^"" 
misc_differehce 643 

/gene^'IGHGl" 

/replace=" n 
misc_difference 654 

/gene^IGHGl" 

/replace»" n 
misc_dif ference 684 

/gene- " IGHGI " 

/replaces"" 
misc__dif ference 692 

/gene= "IGHGI » 

/replace=»" 
misc_dif ference 765 . . 766 

/gene =" IGHGI " 

/replace= M " 
misc_dif ference 1475 

/gene="lGHGi» 

/replace=" 11 
misc_dif ference 1578 

/gene=» IGHGI » 

/replace-"" 

BASE COUNT 418 a 698 c 566 g 327 t 

ORIGIN 

1 agctttctgg ggcaggccag gcctgacctt ggctttgggg cagggagggg gctaaggtga 
61 ggcaggtggc gccagcaggt gcacacccaa tgcccatgag cccagacact ggacgctgaa 
121 cctcgcggac agttaagaac ccaggggcct ctgcgcctgg gcccagctct gtcccacacc 
181 gcggtcacat ggcaccacct ctcttgcagc ctccaccaag ggcccatcgg tcttccccct 
241 ggcaccctcc tccaagagca cctctggggg cacagcggcc ctgggctgcc tggtcaagga 
301 ctacttcccc gaaccggfcga cggtgtcgtg gaactcaggc gccctgacca gcggcgtgca 
361 caccttcccg gctgtcctac agtcctcagg actctactcc ctcagcagcg tggtgaccgt 
421 gccctccagc agcttgggca cccagaccta catctgcaac gtgaatcaca agcccagcaa 
481 caccaaggtg gacaagaaag ttggtgagag gccagcacag ggagggaggg tgtctgctgg 
541 aagcaggctc agcgctcctg cctggacgca tcccggctat gcagccccag tccagggcag 
601 caaggcaggc cccgtctgcc tcttcacccg gagcctctgc ccgccccact catgctcagg 
663. gagagggtct tctggctttt tcccaggctc tgggcaggca caggctaggt gcccctaacc 
721 caggccctgc acacaaaggg gcaggtgctg ggctcagacc tgccaagagc catatccggg 
781 aggaccctgc ccctgaccta agcccacccc aaaggccaaa ctctccactc cctcagetcg 
B41 gacaccttct ctcctcccag attccagtaa ctcccaatct tctctctgca gagcccaaat 
901 cttgtgacaa aactcacaca tgcccaccgt gcccaggtaa gccagcccag gcctcgccct 
961 ccagctcaag gcgggacagg tgccctagag tagcetgcat ccagggacag gccccagccg 
1021 ggtgctgaca cgtccacctc catctcttcc tcagcacctg aactcctggg gggaccgtca 
1081 gtcttcctct tccccccaaa acccaaggac accctcatga tctcccggac ccctgaggtc 
^Itn acat 9 c 9 t 99 tggtggacgt gagccacgaa gaccctgagg tcaagttcaa ctggtacgtg 
1201 gacggcgtgg aggtgcataa tgccaagaca aagccgcggg aggagcagta caacagcacg 
1261 taccgggtgg tcagcgtcct caccgtcctg caccaggact ggctgaatgg caaggagtac 
1321 aagtgcaagg tctccaacaa agccctccca gcccccatcg agaaaaccat ctccaaagcc 
1381 aaaggtggga cccgtggggt gcgagggcca catggacaga ggccggctcg gcccaccctc 
1441 tcccctgaga gtgaccgctg taccaacctc tgtcctacag ggcagccccg agaaccacag 
1501 gtgtacaccc tgcccccatc ccgggatgag ctgaccaaga accaggtcag cctgacctgc 
1551 ctggtcaaag gcttctatcc cagcgacatc gccgtggagt gggagagcaa tgggcagccg 
1521 gagaacaact acaagaccac gcctcccgtg ctggactccg acggctcctt cttcctctac 
1SB1 agcaagctca ccgtggacaa gagcaggtgg cagcagggga acgtcttctc atgctccgtg 
1741 atgcatgagg ctctgcacaa ccactacacg cagaagagcc tctccctgtc tccgggtaaa 
1801 tgagtgcgac ggccggcaag ccccgctccc cgggctctcg cggtcgcacg aggatgcttg 
1B51 gcacgtaccc cctgtacata cttcccgggc gcccagcatg gaaataaacc acccagcgct 
1921 gccctgggcc cctgcgagac tgtgatggtt ctttccacgg gtcaggccga gtctgaggcc 
1931 tgagtggcat gagggaggca gagcgggtc 
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IV. HUMAN IG GAMMA-2 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>sp|P01859|GC2_HUMAN IG GAMMA -2 CHAIN C REGION - Homo capieno (Human). 
10 20 30 40 50 60 

■I ! I I I | 

ASTKGPSVPP LAPCSRSTSE • STAALGCLVK DYFPEPVTVS WNSGALTSGV HTFPAVLQSS 

80 90 100 HO 120 

I I I | | 

GLYSLSSWT VPSSNPGTQT YTCNVDHKPS NTKVDKTVER KCCVECPPCP APPVAGPSVF 

130 140 150 160 170 180 

I I I I I I 

LFPPKPKDTL MISRTPEVTC VWDVSHEDP EVQPNWYVDG VEVHNAKTKP REEQFNSTFR 

!90 200 210 220 230 240 

I I I | | | 

WSVLTWHQ DWLNGKEYKC KVSNKGLPAP IEKTISKTKG QPREPQVYTL PPSREEMTKN 

250 260 270 280 * 290 300 

I I I. | | | 

QVSI/TCLVKG FYPSDIAVEW ESNGQPENNY KTTPPMLDSD GSFFLYSKLT VDKSRWQQGN 

310 320 326 



VFSCSVMHEA LHNHYTQKSL SLSPGK 

CODING SEQUENCE 

gcctccacca agggcccatc ggtcttcccc ctggcgccct 
agcacagccg ccctgggctg cctggtcaag gac tact tec 
tggaactcag gcgctctgac cageggegtg cacaccttcc 
ggactctact ccctcagcag cgtggtgacc gtgccctcca 
tacacctgca aegtagatea caagcccagc aacaccaagg 
aaatgttgtg tcgagtgccc accgtgccca gcaccacctg 
ctcttccccc caaaacccaa ggacaccctc atgatctcoc 
gtggtggtgg aegtgageca cgaagacccc gaggtccagt 
9fc99 a 9gtgc ataatgecaa gaeaaageca egggaggage 
gtggtcagcg tcctcaccgt tgtgcaccag gactggctga 
aaggtctcca acaaaggect cccagccccc atcgagaaaa 
cagccccgag aaccacaggt gtacaccctg cccccafcccc 
caggtcagcc tgacctgcct ggtcaaaggc ttctacccca 
gagagcaatg ggcagccgga gaacaactac aagaccacac 
ggctccttct tcctctacag caagctcacc gtggacaaga 
gtcttctcat getcegtgat gcatgaggct ctgcacaacc 
tccctgtctc egggtaaa 



gctccaggag 
ccgaaccggt 
cagctgtcct 
geaacttegg 
tggacaagac 
tggcaggacc 
ggacccctga 
tcaactggta 
agfctcaacag 
aeggcaagga 
ccatctccaa 
Sggaggagat 
gcgacatcgc 
ctcccatgct 
gcaggtggca 
actacacgca 



cacctccgag 
gacggtgtcg 
acagtcctca 
cacccagacc 
agttgagcgc 
gtcagtcttc 
ggtcacgtgc 
cgtggacggc 
cacgttccgt 
gtacaagtgc 
aaccaaaggg 
gaccaagaac 
cgtggagtgg 
ggactccgac 
gcaggggaac 
gaagagcetc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
978 



GenBank 

J00230. Human Ig germline ... 

HUMIGCD1 2009 bp DNA PRI ll-APR-2001 

Human I g germline Hr-chain G-E-A region B: gamma- 2 constant region, 
3' end, 
J00230 V00554 
J00230.1 GI: 184750 



LOCUS 
DEFINITION 



ACCESSION 
VERSION • 
KEYWORDS 
SOURCE 
ORGANISM 



human. 

Homo sapiens 

BuJcaryota; Metazoa; Cnordata; Craniata,- Vertebrata; Euteleostomi; 
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REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 



MEDLINE 
REFERENCE 
AUTHORS 
TITLE 

MEDLINE 
COMMENT 



FEATURES 



CDS 



Mammalia; Eutheria;* Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2009) 
Ellison, J. and Hood,li. 

Linkage and sequence homology of two human immunoglobulin gamma 
heavy chain constant region genes 

Proc. Natl. Acad. Sci. U.S.A. 79 (6), 1984-1988 (1982) 

82197621 

6B04948 

2 (bases 896 to 1256; 1749 to 1937) 
Krawinkel,U. and Rabbi tts,T.H. 

Comparison of the hinge-coding segments in human immunoglobulin 
gamma heavy chain genes and the linkage of the gamma 2 and gamma 4 
subclass genes JOURNAL EMBO J. 1 (4), 403-407 (1982) 
84235992 PUBMED 6329676 

3 (bases 475 to 1071; 1179 to 1330; 1461 to 1524) 
Takahashi,N. , Ueda,S., Obata,M., Nikaido,T., Nakoi,S. and Honjo,T. 
Structure of human immunoglobulin gamma genes: implications for 
evolution of a gene family JOURNAL Cell 29 (2), 671-679 (1982) 
83001943 PUBMED 6811139 

On Mar 2, 2000 this sequence version replaced gi: 32759. 
f2] also reports sequences for gamma -3, gamma-4, and a gamma 
pseudogene. Most of this sequence is 95% homologous with gamma-4. 
The hinge exono are only 70% homologous. The authors estimate that 
gamma-2 and gamma-4 diverged 6.5 million years ago. The authors in 
[1] speculate that intron-mediated domain transfer played an 
important role in the evolution of human gamma genes. They also 
report the hinge regions of gamma- 1, gamma- 3, gamma- 4, and a 
pseudo- gamma gene. [1] estimates the divergence of the human gamma 
genes to be between 7.7 and 4.4 million years ago. This entry is 
part of a multigene region containing the gamma-2 / gamma-4, 
epsilon-1, and alpha-2 genes. The relative locations of the four 
genes were determined by Flanagan and Rabbi tts (Nature 300, 709-713 
(1982)) . They refer to this gene group as region B. The region A 
genes are gamma-3, gamma-1, pseudo-epsilon, alpha-1. Flanagan and 
Rabbits also determined the general locations of the two regions. 
They place region A between the JH/mu/delta region and region B. 
Complete source information: 

Human fetal liver DNA, library of T. Maniatis [31 and Lawn et al 
[2] , El] ; clones p-gamma-2RPA3 [2) , 5A [3] , and Ig-gamma-2-15 fl] . 
Location/Qualifiers source 1..2009 * 

/organism="Homo sapiens" 



/map= M l4q32.33" 

<l. .215 
<1..2009 
/note="IGHG2" 
/gene="IgH n 
C_region 2 16.. 50 8 



intron 
conflict 



/c3b_xre£= " taxon : 96 06 ■ 
/germline intron 
/gene="igH" gene 
/gene= " igH" 
216. .509 

/note=»G00-119-33B B 
/gene«"lgH" 

/note= tt immunoglobulin heavy chain constant region CHI" 
join(<216. .509,902. .93 7,1056. .1382,1430. .1802) 
/gene^'igH" /codon_start=3 
/products "immunoglobulin gamma-2 heavy chain" 
/protein_id=°AAB59393 - 1" 
/db_xref="GI: 184758" 

/ transl at ion* » STKGPSVFPLAPCSRSTSESTAALGCLVKDYFPEPVTVSWNSGA 
LTSGVHTFPAVLQS SGLYSL3SWTVPS SNFGTQTYTQJVDHKPSNTKVDKTVERKCC 
VECPPCPAPPVAGPSVTLFPPKPKDTLMISRTPEVTCVVTOVSHEDP 
EVHNAKTKPREEQFNSTFRWSVLTVVHQDV7LNGKEYIGCKV 

GQPREPQVYTL PPSREEMTKWQVSLTCL VKGFY PSDI AVEWE SNGQPENNYXTTPPML 

DSDGSFFLYSKLTVDKSRWQQGNvTSCSVWHEALHNHYTQKSLSLSPGK l, 

510. .901 /gene=."IgH" 

537 /gene« 0 igH° 

/citation- [3] /replace^ 0 " 
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conflict 
ccsnflict 
conflict 
conflict 
conflict 
Cjregion 
excn 



550. .551 
/citation» [3] 
570 

/citation^ [3] 
777.. 778 
/citation= [3] 

791 

/citation=[3) 
864 

/citation- [3] 
901. .935 



/genes ,, IgH tt 

/replaces"" 
</gene="IgH" 

/replaces"" 
/gene=»lgH* 

/replaces"" 
/gene= "IgH" 

/replaces" 
/gehe» n lgH M 

/replaces"" 
/gene="IgH B 
/note= "immunoglobulin heavy chain hinge 9 
902.. 937 /genes^gH" 
/note=»G00-119-338» intron 938. .1055 

/gene^IcR" Cjregion 1055.. 1381 

/genes n IgH B 

/notes "immunoglobulin heavy chain constant region CH2" 
1056.. 1382 /genes »igH" 

/note=»G00-119-338» intron 13 83 . .1479 

/gene="lgH n C_region 1479.. 1799 

/genes »igH w 

/note= "imrrrunoglobul in heavy chain constant region CH3*» 
1480.. 1802 /gene=»IgH» 
/note^GOO-l^-SSa" conflict 1493 



ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 



/geneoWigH" 
/replaces" 0 
/gene« "IgH" 
/replace^ ,r B 
/genes »igH" 
/replace* " n 
/gene^IgH" 
/replaces" n 
/gene^igH" 
/replace-" B 
/gene="IgH*« 
/replaces » ■ 
/genes "IgH " 
/gene= "IgH" 
/replaces "*» 
/genes »igH p 
/ replace^ " "BASE COUHT 



conflict 

conflict 

conflict 

conflict 

conflict 

polyA_signal 
conflict 

conflict 



/citation** [3] 
1802 . .1806 
/citations [2] 
1814. .1815 
/citations [2] 
1825 

/citation- [2] 
1844. .1853 
/citation^ [2] 
1890 

/citations [2] 
1903. .1908 
1909. .1918 
/citations [2] 
1929. .1932 
/citations [2] 
410 a 700 c 558 g 



331 t 



agctttctgg 
ggcaggtggc 
ctggaccctc 
cacaccgcgg 
ccccctggcg 
caaggactac 
cgtgcacacc 
gaccgtgccc 
cagcaacacc 
tgctggaagc 
agggcagcaa 
tgctcaggga 
gcccctaccc 
catatccggg 
cctcagctcg 
gagcgcaaat 
tccagctcaa 
gggtgbtgac 
cttcctcttc 
gtgcgtggtg 
cggcgtggag 



ggcgagccgg 
gccagccagg 
gtggatagac 
tcacatggca 
ccctgctcca 
ttccccgaac 
ttcccagctg 
tccagcaact 
aaggtggaca 
caggctcagc 
ggcaggcccc 
gagggtcttc 
caggcccttc 
aggaccctgc 
gacaccttct 
gttgtgtcga 
ggcgggacag 
acgtccacct 
cccccaaaac 
gtggacgtga 
gtgcataatg 



gcctgactct 
tgcacaccca 
aagaaccgag 
ccacctctct 
ggagcacctc 
cggtgacggt 
tcctacagtc 
tcggcaccca 
agacagttgg 
cctcctgccc 
atctgtctcc 
tggctttttc 
acacacaggg 
ccctgaccta 
ctcctcccag 
gtgcccaccg 
gtgccctaga 
ccatctcttc 
ccaaggacac 
gccacgaaga 
ccaagacaaa 



ggctttgggg 
atgcccgtga 
gggcctctgc 
tgcagcctcc 
cgagagcaca 
gtcgtggaac 
ctcaggactc 
gacctacacc 
tgagaggcca 
ggacgcaccc 
tcacccggag 
caccaggctc 
gcaggtgctt 
agccgacccc 
atccgagtaa 
tgcccaggta 
gtagcctgca 
ctcagcacca 
cctcatgatc 
ccccgaggtc 
gccacgggag 



cagggagtgg 

gcccagacac 

gcctgggccc 

accaagggcc 

gccgccctgg 

tcaggcgctc 

tactccctca 

tgcaacgtag 

gctcagggag 

cggctgtgca 

gcctctgccc 

caggcaggca 

ggctcagacc 

aaaggccaaa 

ctcccaatct 

agccagccca 

tccagggaca 

cctgtggcag 

tcccggaccc 

cagttcaact 

gagcagttca 



gctaaggtga 

tggaccctgc 

agctctgtcc 

catcggtctt 

gctgcctggt 

tgaccagcgg 

gcagcgtggt 

atcacaagcc 

ggagggtgtc 

gccccagccc 

gccccactca 

caggctgggt 

tgccaaaagc 

ctgtccactc 

tctctctgca 

ggcctcgccc 

ggccccagct 

gaccgtcagt 

ctgaggtcac 

ggtacgtgga 

acagcacgtt 
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1261 ccgtgtggtc agcgtcctca ccgttgtgca ccaggactgg ctgaacggca aggagtacaa 
1321 gtgcaaggtc tccaacaaag gcctcccagc ccccatcgag aaaaccatct ccaaaaccaa 
1381 aggtgggacc cgcggggtat gagggccaca tggacagagg ccggctcggc ccaccctctg 
1441 ccctgggagt gaccgctgtg ccaacctctg tccctacagg gcagccccga gaaccacagg 
1501 tgtacaccct gcccccatcc cgggaggaga tgaccaagaa ccaggtcagc ctgacctgcc 
1561 tggtcaaagg cttctacccc agcgacatcg ccgtggagtg ggagagcaat gggcagccgg 
1621 agaacaacta caagaccaca cctcccatgc tggactccga cggctccttc ttcctctaca 
1681 gcaagctcac cgtggacaag agcaggtggc agcaggggaa cgtcttctca tgctccgtga 
1741 tgcatgaggc tctgcacaac cactacacgc agaagagcct ctccctgtct ccgggtaaat 
1801 gagtgccacg gccggcaagc ccccgctccc caggctctcg gggtcgcgtg aggatgcttg 
1861 gcacgtaccc cgtgtacata cttcccaggc acccagcatg gaaataaagc acccagcgct 
1921 gccctgggcc cctgcgagac tgtgatggtt ctttccgtgg gbcaggccga gtctgaggcc 
1981 tgagtggcat gagggaggca gagtgggtc 

V. HUMAN IG GAMMA-3 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

CAA272 68 C gamma 3 [Homo sapiens] 

10 20 30 40 50 60 

I I I I 1 I 

ASTKGPSVFP LAPCSRSTSG GTAALGCIiVK DYFPEPVTVS WNSGAI/TSGV HTFPAVLQSS 

70 80 90 100 110 120 

I I I i I I 

GLYSLSSWT VPSSSLGTQT YTCNVNHKPS NTKVDKRVEL KTPLGDTTHT CPRCPEPKSC 

130 140 150 160 170 180 

I I I I I I 

DTPPPCPRCP EPKSCDTPPP CPRCPBPKSC DTPPPCPRCP APELLGGPSV FLFPPKPKDT 

190 200 210 220 230 240 

U i I I I I 

LMISRTPETVT CVWDVSHED PEVQFKWYVD GVEVHNAKTK PREEQYNSTF RWSVLTVLH 

250 260 270 280 290 300 

I I I I I I 

QDWLNGKEYK CKVSNKALPA PIEKTISKTK GQPREFQVYT LPPSREEMTK NQVSLTCLVK 

310 320 330 340 350 360 

I I I I I I 

GFYPSDIAVE WBSSGQPENN YNTTPPMLDS DGSFFLYSKL TVDKSRWQQG NIFSCSVMHE 

370 377 
I 

ALHNRFTQKS LSLSPGK 
CODING SEQUENCE 

GCTTCCACCAAGGGCCCATCGGTCTTCCCCCTGGCGCCCTGCT 

CTGCCTGGTCAAGGACTACTTCCCCGAACCGGTGACGGTGTCGTGGAACTCAGGCGCCCTGACCAGCGGCGTGCACA 

CCTTCCCGGCTGTCCTACAGTCCTCAGGACTCT^ 

ACCCAGACCTACACCTGCAACGTGAATCACAAGCCCAGCA^ 

ACTTGGTGACACAACTCACACATGCCCACGGTGCCGAGAGCCCAAATC^ 

GCCC&GAGCCCAAATCTTX3TGAC^C^CCTCCCCCATGCCCAC^ 

CCGTGCCCAAGGTGCCCAGCACCTCAACXCCTGGGAGGACCGTCAGTCTTCCTCTTCCC 
CCOTATGATTTCCCGGACCCCTGAGGTCACGTGCGTGGTGGTGGAC^ 

AGTGGTACGTGGACGGCGTGGAGGTGC^TAATGCCAAGACAAAGCCGCGGGAGGAGCAGTAC^ACAGCACGTTCCGT 
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GXGGTCAGCGTCCTCMCGTC^ 

CCTCCCAT^CCCCATCGAGAAAACCATCTCC^^ 

CATCCCGGGAGGAGATGACCAAGAACCAGGTCAGCCTCTCCTC 

GTGGAGTGGGAGAGC!AGC^GGCAGCCGGJy3AACAACTACA 

CTTCC7CTACAGCAAGCTCACC3TGGACAAGAGCAGGIGGCA 

AGGCTCTGCACAACCGCTTCACSCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATGA 

GenBank 

X03604 Human C gamma 3 gene for IgG G3tn(b) heavy chain C-region from EZZ 
(individual II-4 of TOU) PubMed, Protein, Related Sequences, Taxonomy, OMIM, 
LinJcOut 



LOCUS 

DEFINITION 

ACCESSION" 

VERSION 

KEYWORDS 

SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 

MEDLINE 
REFERENCE 

AUTHORS 
Honjo, T. 

TITLE 

JOURNAL 
MEDLINE 
FEATURES 

source 



conflict 



conflict 



conflict 



conflict 



CDS 



HSIGGC3 2590 bp DNA PR I 24-NOV-1993 

Human C garrnia 3 gene for IgG G3m{b) heavy chain C-region from EZZ 
(individual I I -4 of TOU) . 
X03604 M12958 
X03S04.1 31:33070 

constant region; gamma -imrnunoglobul in; Ig heavy chain; 

immunoglobulin. 

human. 

Homo sapiens 

Eukaryota? Metazoa; Chordata; Craniata; Vertebrata; Euteleoscomi; 
Mammalia; Butheria; Primates; Catarrhini; Eominidae; Homo. 

1 (bases 1 to 2590) 

Huck,s w 7ort,P., Crawfora,D.H., bef ranc,M.P . and Lefranc,G. 
Sequence oE a human immunoglobulin gamma 3 heavy chain constant 
region gene: comparison with the other human C gamma genes 
Nucleic Acids Res. 14 (4) , 1779-1789 (1986) 
86148507 

2 (bases 3, to 204; 2202 to 2236) 

Takahashi,KT. , Ueda,S., Obata,M., Nikaido,T., Nakai,S. and 

Structure of human immunoglobulin gamma genes: implications for 
evolution of a gene family 
Cell 29 {2), 671-679 (19B2) 
83001943 

Location/Qualifiers 
1. .2590 

/organism- "Homo sapiens" - 
! db^jcref = " taxon ; 96 0 6 " 
23 

/note="T is missing in [2] " 
/citation- [2] 
80.. 82 

/note«"CGC is GCG in [2] " 
/citation^ [2] 
120 

/note-"C is T in [2] » 
/citations [2] 
135 

/note^T is C in [2] * 
/citation=[2] 

join (<215 . . 509, 901 . . 951, 1095 . . 1139 , 12 83 . . 1327 , 
1471. .1515, 1634 . .1963, 2061. . >2380) 
/codon_start-l 
/product- "C gamma 3 " 
/protein_id= B CAA27268 . 1" 
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/db_xref= M GI: 577056" 



/translation- "ASTKGPSVCTLA^ 

ALTSGVHTFPAVLQSSGLYSLSSVVTVPSSSLGTQTYTCOTNH^ 

PLGDTTHTCPRC PE PKS CDTPPPCPRC PB PKS CDT P PPCPRCPE PKS CDTP P PCPRCP 

APSLLGGPSVFLFPPKPKDTLMISRTPEVTCVVVDVSHEDP 

TKPRBEQYNSTFRWSVLTVLHQDWLW^ 

QVYTLPPSREEMTKNQVSLTCLVKGFYPSDIAVEWESSGQPENDTYNTTPPMLDSDGSF 



intron 


FLYSKLTVBKSRWQQC^IPSCSVMHEALHIJRFTQK 


510.. 900 




/note=" intron I" 


intron 


952.. 1094 




/not e=" intron II" 


intron 


1140.. 1282 


intron 


/note=" intron III" 


1328. .1470 




/note=" intron IV" 


intron 


1516. .1633 




/not e=" intron V" 


intron 


1964. .2060 




/note =" intron VI" 


misc_feature 


2484. .2489 


COUNT 541 


/note="pot. polyA signal" 


a 925 C 703 g 421 t 



1 agctttctgg ggcaggccag gcctgacttt ggctgggggc agggaggggg ctaaggtgac 
51 gcaggtggcg ccagccaggc gcacacccaa tgcccgtgag cccagacact ggaccctgcc 
121 tggaccctcg tggatagaca agaaccgagg ggcctctgcg ccctgggccc agctctgtcc 
181 cacaccgcag tcacatggcg ccatctctct tgcagcttcc accaagggcc catcggtctt 
241 ccccctggcg ccctgctcca ggagcacctc tgggggcaca gcggccctgg gctgcctggt 
301 caaggactac ttccccgaac cggtgacggt gtcgtggaac tcaggcgccc tgaccagcgg 
3 51 cgtgcacacc ttcccggctg tcctacagtc ctcaggactc tactccctca gcagcgtggt 
421 gaccgtgccc tccagcagct tgggcaccca gacctacacc tgcaacgtga atcacaagcc 
481 cagcaacacc aaggtggaca agagagttgg tgagaggcca gcgcagggag ggagggtgtc 
541 tgctggaagc caggctcagc cctcctgcct ggacgcatcc cggctgtgca gtcccagccc 
601 agggcagcaa ggcaggcccc gtctgactcc tcacccggag cctctgcccg ccccactcat 
661 gctcagggag agggtcttct ggctttttcc accaggctcc gggcaggcac aggctggatg 
721 cccctacGcc aggcccttca cacacagggg caggtgctgc gctcagagct gccaaaagcc 
7B1 atatccagga ggaccctgcc cctgacctaa gcccacccca aaggccaaac tctctactca 
841 ctcagctcag acaccttctc tcttcccaga tctgagtaac tcccaatctt ctctctgcag 
901 agctcaaaac cccacttggt gacacaactc acacatgccc acggtgccca ggtaagccag 
961 cccaggactc gccctccagc tcaaggcggg acaagagccc tagagtggcc tgagtccagg 
1021 gacaggcccc agcagggtgc tgacgcatcc acctccatcc cagatccccg taactcccaa 
1081 tcttctctct gcagagccca aatcttgtga cacacctccc ccgtgcccac ggtgcccagg 
1141 taagccagcc caggcctcac cctccagctc aaggcaggac aagagcccta gagtggcctg 
1201 agtccaggga caggccccag cagggtgctg acgcgtccac ctccatccca gatccccgta 
12S1 actcccaatc ttctctctgc agagcccaaa tcttgtgaca cacctccccc atgcccacgg 
1321 tgcccaggta agccagccca ggcctcgccc tccagctcaa ggcgggacaa gagccctaga 
1381 gtggcctgag tccagggaca ggccccagca gggtgetgac gcatccacct ccatcccaga 
1441 tccccgtaac tcccaatctt ctctctgcag agcccaaatc ttgtgacaca cctcccccgt 
1501 gcccaaggtg cccaggtaag ccagcccagg cctcgccctc cagctcaagg caggacaggt 
1561 gccctagagt ggcctgcatc cagggacagg tcccagtcgg gtgctgacac atctgcctcc 
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1621 

1681 

1741 

1801 

1361 

1921 

1981 

2041 

2101 

2161 

2221 

2281 

2341 

2401. 

2461 

2521 

2581 



atctcttcct 
cccaaggata 
agccacgaag 
gccaagacaa 
accgtcctgc 
gccctcccag 
tgagggccac 
gccaacctct 
ccgggaggag 
cagcgacatc 
gcctcccatg 
gagcaggtgg 
ccgcttcacg 
cccccgctcc 
acttcccggg 
ctgtgatggt 
agagcgggtc 



cagcacctga 
cccttatgat 
accccgaggt 
agccgcggga 
accaggactg 
cccccatcga 
atggacagag 
gtccctacag 
atgaccaaga 
gccgtggagt 
ctggactccg 
cagcagggga 
cagaagagcc 
ccgggctctc 
cacccagcat 
tctttccacg 



actcctggga 
ttcccggacc 
ccagttcaag 
ggagcagtac 
gctgaacggc 
gaaaaccatc 
gccagcttga 
gacagccccg 
accaggtcag 
gggagagcag 
acggctccfct 
acatcttctc 
tctccctgtc 

ggggtcgcgc 

ggaaataaag 
ggtcaggccg 



ggaccgtcag 
cctgaggtca 
tggtacgtgg 
aacagcacgt 
aaggagtaca 
tccaaaacca 
cccaccctct 
agaaccacag 
cctgacctgc 
cgggcagccg 
cttcctctac 
atgctccgtg 
tccgggtaaa 
gaggatgctt 
cacccagcgc 
agtctgaggc 



tcttcctctt ccccccaaaa 



cgtgcgtggt 
acggcgtgga 
tccgtgtggt 
agtgcaaggt 
aaggtgggac 
gccctgggag 
gtgtacaccc 
ctggtcaaag 
gagaacaact 
agcaagctca 
atgcatgagg 
tgagtgcgac 
ggcacgtacc 
tgccctgggc 
ctgagtgaca 



ggtggacgtg 
ggtgcataat 
cagcgtcctc 
ctccaacaaa 
ccgcggggta 
tgaccgctgt 
tgcccccatc 
gcttctaccc 
acaacaccac 
ccgtggacaa 
ctctgcacaa 
agccggcaag 
ccgtgtacat 
ccctgtgaga 
tgagggaggc 



VI. HUMAN IG GAMMA-4 CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>sp|P0186l|GC4_HUMAN IG GAMMA-4 CHAIN C REGION - Homo sapiens (Human) . 
10 20 ' 30 40 50 60 

I I I I I I 

ASTKGPSVFP LAPCSRSTSE STAALGCLVK DYFPEPVWS WNSGALTSGV HTFPAVLQSS 
70 80 90 100 110 120 

I I ' I I I I 

GLYSLS3WT VPSSSLGTKT YTCNVDHKPS NTKVDKRVES KYGPPCPSCP APEFL*GGPSV 

' 130 140 150 160 170 180 

II I I I I 

FLFPPKPKDT LMISRTPEVT CVWDVSQED PEVQFNWYVD GVEVHNAKTK PREEQFNSTY 

190 200 210 220 230 240 

II I I I I 

RWSVLTVLH QDWLNGKEYK CKVSNKGLPS SIEKTISKAK GQPREPQVYT LPPSQEEMTK 

250 260 270 280 290 300 

I I ! I I i 

NQVSLTCLVK GFYPSDIAVE WESNGQPENN YKTTPPVLDS DGSFFLYSRL TVDKSRWQEG 
310 320 327 

I I I 

NVFSCSVMHE ALHNHYTQKS LSLSLGK 
CODING SEQUENCE 

g -1 

cttccaccaa gggcccatcc gtcttccccc tggcgccctg ctccaggagc acctccgaga SO 

gcacagccgc cctgggctgc ctggtcaagg actacttccc cgaaccggtg acggtgtcgt 120 

ggaactcagg cgccctgacc agcggcgtgc acaccttccc ggctgtccta cagtcctcag 1B0 

gactctactc cctcagcagc gtggtgaccg tgccctccag cagcttgggc acgaagacct 240 

acacctgcaa cgtagatcac aagcccagca acaccaaggt ggacaagaga gttgagtcca 300 

aatatggtcc cccatgccca tcatgcccag cacctgagtt cctgggggga ccatcagtct 3 60 

tcctgttccc cccaaaaccc aaggacactc tcatgatctc ccggacccct gaggtcacgt 420 
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gcgtggtggt 
gcgtggaggt 
gtgtggtcag 
gcaaggtctc 
ggcagccccg 
accaggtcag 
gggagagcaa 
acggctcctt 
atgtcttctc 
tctccctgtc 



ggacgtgagc 
gcataatgcc 
cgtcctcacc 
caacaaaggc 
agagccacag 
cctgacctgc 
tgggcagccg 
cttcctctac 
atgctccgtg 
tctgggtaaa 



caggaagacc 
aagacaaagc 
gtcctgcacc 
ctcccgtcct 
gtgtacaccc 
ctggtcaaag 
gagaacaact 
agcaggctaa 
atgcatgagg 
tga 



ccgaggtcca 
cgcgggagga 
aggactggct 
ccatcgagaa 
tgcccccatc 
gcttctaccc 
acaagaccac 
ccgtggacaa 
ctctgcacaa 



gttcaactgg 
gcagttcaac 
gaacggcaag 
aaccatctcc 
ccaggaggag 
cagcgacatc 
gcctcccgtg 
gagcaggtgg 
ccactacaca 



tacgtggatg 
agcacgtacc 
gagtacaagt 
aaagccaaag 
atgaccaaga 
gccgtggagt 
ctggactccg 
caggagggga 
cagaagagcc 



480 
540 
600 
660 
720 
780 
840 
900 
960 
983 



GenBanJc 

K01316. Human Ig germline 



LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUB MED 
REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



HUMIGCD2 2028 bp DNA PRI ll-APR-2001 

Human Ig germline H-chain G-E-A region Br gamma-4 constant region, 
3' end. 
K0131S 

K01316.1 GI:184751 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chorda ta; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 2028) 

Ellison, J., Buxbaura,J. and Hood,L. 

Nucleotide sequence of a human immunoglobulin C gamma 4 gene 

DNA 1 (1), 11-18 (1981) 

83157104 

2 (bases 475 to 1069; 1180 to 1331; 1432 to 1655) 

Takahashi , N . , Ueda,S., Obata,M., Nikaido,T., Nakai,S. and Hon jo, T. 

Structure of human immunoglobulin gamma genes: implications for 

evolution of a gene family 

Cell 29 .(2), 671-679 (1982) 

83001943 

6811139 

3 (bases 894 to 1106) 
Krawinkel,U. and Rabbitts,T .H. 

Comparison of the hinge- coding segments in human immunoglobulin 
gamma heavy chain gene 3 and the linkage of the gamma 2 and gamma 4 
subclass genes 

EMBO J. 1 <4), 403-407 (1982) 

84235992 

6329676 

4 (bases 1 to 2028) 
Ellison, J. and Hood,L. 

Linkage and sequence homology of two human immunoglobulin gamma 
heavy chain constant region genes 

Proc. Natl. Acad. Sci. U.S.A. 79 (6) , 1984-1988 (1982) 

82197621 

6804948 

[1] reports that the human C-gamma-4 gene is equally homologous to 
the mouse gamma- 1, gamma- 2a, and gamma- 2b genes (about 75%) . [33 
also reports partial sequences for human gamma-2, gamma-3, and a 
gamma pseudogene. [2] presents the garama-1, gamma-2, gamma-3, and 
pseudo-gamma hinge regions . 

This entry is part of a multigene region (region B) , which includes 
the gamma-2, gamma- 4, epsilon-1, and alpha-2 genes. See segment 1 
for more comments. 
Complete source information i 

Human fetal liver DNA, library of T. Maniatis [3] and Lawn et al 
[1] , 12] ; clones 24B [1] , lambda-HG4.1 [3], and Ig-gamma-4-2 {2]. 

Location/ Qualifiers 

1..2028 



23/65 



WO 01/83529 



PCT/US01/13932 



FIGURE 7 R 



gene 
intron 

CDS 



mbron 

exon 

intron 

exon 

intron 

exon 



BASE COUNT 421 
ORIGIN 

1 agctttctgg 
51 gcaggtggcg 
121 tggaccatcg 
181 cacaccgcgg 
241 ccccctggcg 
301 caaggactac 
3 SI cgtgcacacc 
421 gaccgtgccc 
431 cagcaacacc 
541 tgctggaagc 
601 agggcagcaa 
6 Si tgctcaggga 
721 cccctacccc 
781 tatccgggag 
841 tcagctcaga 
901 gtccaaatat 
951 cagctcaagg 
1021 gtgctgacgc 
1031 tcttcctgtt 
1141 cgtgcgtggt 
1201 atggcgtgga 
12 SI accgtgtggt 
1321 agtgcaaggt 
1331' aaggtgggac 



/organism-* "Homo sapiens' 1 
/db_xref =* " taxon : 9 6 0 6 " 
/raap='■14q^2.33 , ' 
/germline 

<1. .2028 / 

/gene= "igH" 

/notes" IGHG4" 

<1. .215 

/geneB"igH" 

7note="gamma-4 intron J-C 

join(<216. .509,900. .935,1054. .1383,1481. .1803) 

/genesTgH" 

/codon_start=3 

/product=" immunoglobulin gamma -4 heavy chain" 
/protein_id= n AAB593 94 . 1" 
/dJb_xref« »GI r 184 759 ff 

/trans lat ion= "STKGPSVFPLAPCSRSTSESTAALQCI»VKDYPPEPVXVSWWSGA 
LTSGVHTFTAVLQSSGL YS LS S WTVPS SSLGTfCTY^ KYG 
PPCPSCPAPEFLGGPSVFLFPPKPKDTIMISRTPEK/^^ 

VEVHWAKTKPREEQFNS T YRWSVLrVLHQDWLIJGKKYKCKVSMKGLPSSI EKTI SKA 
KGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEVJESKGQPENNYKTTPPV 

21€ . .509 
/gene-" IgH" 
/note="G00-119-340° 
510.. 899 
/gene«"IgH" 
900.. 935 
/gene=»«igH" 
936,. 1053 
/gene» H IgH" 
1054.. 1383 
/gene^IgH" 
1384.. 1480 
/gene^IgH" 
1481.. 1803 
/gene= , MgH ,, 
a 709 c 567 g 



ggcaggccgg 
ccagccaggt 
cggatagaca 
tcacatggca 
ccctgctcca 
ttccccgaac 
ttcccggctg 
tccagcagct 
aaggtggaca 
caggctcagc 
ggcatgcccc 
gagggtcttc 
aggccctgcg 
gaccctgccc 
caccttctct 
ggtcccccat 
cgggacaggt 
atccacctcc 
ccccccaaaa 
ggtggacgtg 
ggtgcataat 
cagcgtcotc 
ctccaacaaa 
ccacggggtg 



gcctgacttt 
gcacacccaa 
agaaccgagg 
ccacctctct 
ggagcacctc 
cggtgacggt 
tcctacagtc 
tgggcacgaa 
agagagttgg 
cctcctgcct 
atctgtctcc 
tggatttttc 
catacagggc 
ctgacctaag 
cctcccagat 
gcccatcatg 
gccctagagt 
atctcttcct 
cccaaggaca 
agccaggaag 
gccaagacaa 
accgtcctgc 
ggcctcccgt 
cgagggccac 



331 t 

ggctgggggc 
tgcccatgag 
ggcctctgcg 
tgcagcttcc 
cgagagcaca 
gtcgtggaac 
ctcaggactc 
gacctacacc 
tgagaggcca 
ggacgcaccc 
tcacccggag 
caccaggctc 
aggtgctgcg 
cccaccccaa 
ctgagtaact 
cccaggtaag 
agcctgcatc 
cagcacctga 
ctctcatgat 
accccgaggt 
agccgcggga 
accaggactg 
cctccatcga 
acggacagag 



agggaggggg 
cccagacact 
ccctgggccc 
•accaagggcc 
gccgccctgg 
tcaggcgccc 
tactccctca 
'tgcaacgtag 
gcacagggag 
cggctgtgca 
gcctctgacc 
ccggcaccac 
ctcagacctg 
aggccaaact 
cccaatcttc 
ccaacccagg 
cagggacagg 
gttcctgggg 
ctcccggacc 
ccagttcaac 
ggagcagttc 
gctgaacggc 
gaaaaccatc 
gccagctcgg 



ctaaggtgac 
ggaccctgca 
agctctgtcc 
catccgtctt 
gctgcctggt 
tgaccagcgg 
gcagcgtggt 
atcacaagcc 
9gagggtgtc 
gccccagccc 
accccactca 
aggctggatg 
ccaagagcca 
ctccactccc 
tctctgcaga 
cctcgccctc 
ccccagccgg 
ggaccatcag 
cctgaggtca 
tggtacgtgg 
aacagcacgt 
aaggagtaca 
tccaaagcca 
cccaccctct 
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1441 gccctgggag tgaccgctgt gccaacctct gtccctacag ggcagccccg agagccacag 
1501 gtgtacaccc tgcccccatc ccaggaggag atgaccaaga accaggtcag cctgacctgc 

1561 ctggtcaaag gcttctaccc cagcgacatc gccgtggagt gggagagcaa tgggcagccg 

1621 gagaacaacfc acaagaccac gcctcccgtg ctggactccg acggctcctt cttcctctac 

1681 agcaggctaa ccgtggacaa gagcaggtgg caggagggga atgtcttctc atgctccgtg 

1741 atgcatgagg ctctgcacaa ccactacaca cagaagagcc tctccctgtc tctgggtaaa 

1801 tgagtgccag ggccggcaag cccccgctcc ccgggctctc ggggtcgcgc gaggatgctt 

1861 ggcacgtacc ccgtctacat acttcccagg cacccagcat ggaaataaag cacccaccac 

1921 tgccctgggc ccctgtgaga ctgtgatggt tctttccacg ggtcaggccg agtctgaggc 

1981 ctgagtgaca tgagggaggc agagcgggtc ccactgtccc cacactgg 

VH. HUMAN IG DELTA CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>sp)P01380|DTC_HUMAN IG DELTA CHAIN C REGION - Homo sapiens (Human) . 

40 50 SO 



100 110 120 

I I I 



10 


20 


30 


1 

APTKAPDVFP 


1 

IISGCRHPKD 


1 

JTSPWIiACLI 


70 


80 


90 


1 

DSYYMTSSQL 


1 

STPLQQWRQG 


1 

SYKCWQHTA 


130 


140 


150 


1 

SLAK&TTAPA 


I 

TTRNTGRGGE 


1 

EKKKEKEKEE 


190 


200 


210 


i 

WLRDKATFTC 


1 

FWGSDLKDA 


1 

HLTWEVAGKV 


250 


260 


270 


1 

WNAGTSVTCT 


1 

LNHPSLPPQR 


1 

liMALREPAAQ 


■ 310 
1 


320 


330 
1 

APARPPPQPG 


PPNILLMWLE 


1 

DQREVNTSGF 


370 


380 




I. 

DSRTLLNASR 


i 

SLEVSYVTDH 


GPM 



160 170 180 



220 230 240 



280 290 300 

I I I 



340 350 ' 360 

I 'I I 



GenBank 

K02876. Human germline IgD. [gi : 184766} PubMed, Protein, Related Sequences, Taxonomy, 
OMIM, LinkOut 1 

LOCUS HUMIGCH02 300 bp DNA PRI -08 -NOV- 1994 

DEFINITION Human germline igD-chain gene, C-region, first hinge domain. 
ACCESSION K02B76 
VERSION K02B76.1 GI: 184766 

KEYWORDS C-region; germline,-. hinge exon,- immunoglobulin heavy chain ,* 

immunoglobulin- delta . 
SOURCE Homo sapiens Undividual_isolate Chronic lymphocytic leukemia (CLL) 

patient) DNA. 
ORGANISM Homo sapiens 

BuJcaryota; Metazoa; Chordata,- Craniata; VerteJbrata; Euteleostomi; 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 

COMMENT 

FEATURES 

source 



Catarrhini; Homihidae; Homo. 



mtron 



nitron 



gene 



CDS 



CDS 



Mammalia; Eutheria; Primates; 
1 (bases l to 300) 
White, M. B., Shen,A.L., Word,c.a., Tucker, P.w. and Blattner,F.R. 
Human immunoglobulin D: genomic sequence of the delta heavy cha<n 
Science 228 (4700), 733-737 (1985) 
85192522 
See segment 1. 

Location/Qualifiers 

1 . .300 

/organism^ "Homo sapiens" 

/iBolate=»Chronic lymphocytic leukemia (CLL) patient" 

/db_xref = n t axon : 9 6 06 " 

/maps" 1^32.33" 

/eel l_type= n lymphocyte " 

/germline 

<1 . .151 

/gehe=»IGHD" 

/note=»G00-120-084 tI 

/number=l 

101.. 202 

/partial 

/gene=» n IGHD" 

/note-"hinge^l, domain; GOO-120-084" 

/number«2 

203 . . >300 

/gene=»IGHD M 

/note="GOO-120-084" 

/number=2 

join(K02875 . 1:1. .495,1. .300, K02877. 1:1. .300, 
K02878.1:!. .500,K02879.1:1. .500,K02880.1:1. .100, 
K02881.1:l. .200,K02B82.1:1. .52) 
/gene= w I3HD M 

join(K02875. 1:101. .403,101. . 202 , K02877 . 1 ; 101 . .172 

K02878 . 1 :1G1 > .424 , K02879 . 1 : 101 . .424,K02881.1:25. 182 

K02882. 1.-44. .52) 

/partial 

/gene="IGHD" 

/not e»? membrane bound form" 
/codon — start=3 . 

/product = " immunoglobulin del ta- chain" 
/protein_id= "AAA52 771 . 1 » 
/db_xr ef '= " GI : 4 9 5 8 7 2 " 
/db_xref = "GD3 : GO 0-120 -084 " 

/ 1 ran sla t ion = » ptkapdvfpi isgcrhpkdnspvvlacli tgyhptsvtvtwymg 

TQSQPQRTFPEIQPJUDSYYMTSSQLSTPI^QWRQGEYKCVVQHTASKSKKEIFRWPES 
PKAQASSVPTAQPQAEGSliAKATTAPATTWTGRGGEEK^ 

PSHTOPLGVYLLTPAVQDLWtjRrKATFTCFWGSDIJa^LTWBVAGKVPTGGVEEGL 
LERHSNGSQSQHSRLTLPRSLWNAGTSVTCTLNHPSLPPQRLr4ALREPAAQAPVKbSL 
NLLAS SDP PEAAS WLLCEV3 GFSP PN I LLMWLEDQREVNTSGFAPARPP PQPRSTTFW 

AWSVLRVPAPPSPQPATYTCWSHEDSRTLI^ASRSLEVSYIiAMTPLIPQSKDElJSDD 

YTTFDDVGSLWTTLS TFVALFI LTLLYSGIVTF I KVK » 

join(K02875. 1:101. .403,101. . 202 , K02877 . 1:101. .172, 

K0287B. 1:101. .424, K02879. 1:101. . 424 , K02880 . 1 : 25 . . 53 ) 

/partial 

/gene» "IGHD" 

/notes" secreted form" 

/codon_start=3 

/products'* immunoglobulin delta-chain" 
/pro t ein_id= " AAAS 2 7 70 . 1 " 
/ db^xref s » GI : 4 95 8 7 1 " 
/db__xref = "GDB :<300- 120-084 ■ 

/ translation* " PTKAPDVFPI ISGC^PKDNSPVVIiACLlTGYHPTSAm/TWYMG 
TQ SQPQRTFPE I QRRDS YYMTSSQLS TPLQQWRQGEYKCVVQHTAS KSKKE I FRWPES 
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U 



PKAQASSVPTAQPQAEGSLAMTTAPATTROTGRGGEEKK^ 

PSHTQPLGVYLLTPAVQDLWLRDKATFTCFVVGSDliKDAHLTWEVAGKVPTGGVEEGL 
LEEi^NGSQSQHSRLTLPRSLWNAGTSVTCTLNHPSIxPPQRL^^ 
KLLASSDPPBAASWLLCEVSGFSPPNILLMWLBI)QREVKTSGFAPARPPPQPRST?FW 
AWSVLRVPAPPSPQPATYTCWSHEDSRTLLNASRSLEVSTVTDHGPMK" 



BASE COUNT 5? a 133 C 52 g 56 t 

ORIGIN About 300 bp after segment 1; 118 bp upstream of StuI site. 

1 taggctgcct gtgcccccca cctgcctgtc cacaacccag cctctggtac atccatgccc 
Si tctgccctaa gcctcacctg cacttttcct tggatttcag agtctccaaa ggcacaggcc 
121 tcctccgtgc ccactgcaca accccaagca gagggcagcc tcgccaaggc aaccacagcc 
181 ccagccacca cccgtaacac aggtgagaag ccccttccct gcacactcca cccccaccca 
241 cctgctcatt cctcagccgc ctcctccagg cagcccttca taactccttg tctgagtctc 

K02877. Human Ig germline 
OMIM, LinkOut 



. tgi: 184767] PubMed, Protein, Related Sequences, Taxonomy, 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 

COMMENT 

FEATURES 

source 



mtron 



gene 



CDS 



HUMIGCH03 300 bp DNA PRI 08-NOV-1994 

Human Ig germline delta H-chain C-region gene, second hinge domain 
(CLL lymphocyte) . 
K02877 

Kp2877.1 GI: 184767 

C-region; germline; hinge exon; immunoglobulin heavy chain,-, 
immunoglobulin- delta . 

Homo sapiens { individual^ solate Chronic lymphocytic leukemia (CLL) 
patient) DNA. 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata ; Euteleostomi j 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 300) 

White, M.B., Shen,A.L., Word/C.J., Tucker,P.W. and Blattner, F.R. 

Human immunoglobulin D: genomic sequence of the delta heavy chain 

Science 228 (4700) , 733-737 (1985) 

85192522 

See segment 1 . 

Location/Qualifiers 

1. .300 

/organi sm= " Homo s api ens " 

/isolate= "Chronic lymphocytic leukemia (CLL) patient" 
/db_xre £ = n t axon : 9 6 0 6 ■' 
. /map=»14q32.33 M 

/eel l_type = " lymphocyt e " 

/germline 

<1. .100 

/gene= tt IGHD" 
/note= "G00-120-084 * 
/number= 2 
101.. 172 
/gene="IGHD" 

/note= M hinge-2 domain; GO0-120-O84; putative" 

/nutnber=3 

173 . . >300 

/gene^lGHD" 

/note= B G00-120-084" 

/numbers 

join (K02875. 1:1. .495 , K02876 . 1 : 1 . .300,1. .300, 
K02878.1:l. . 500,K02879 . 1 :1 . .500,K02880.1 :1 . .100, 
K02881.1;!. . 200, K02882 . 1 :1 . .52) 
/gene=°IGHD" 

join(K02875. 1:101. .403, KD2876 . 1 : 101 . .202,101. .172, 
K02S78. 1:101. .424, K02879 -1 :101 . .424, K02881. 1:25 - .182, 
K02882 .1:44. .52) 
/partial 
/gene« "IGHD" 
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/note= n membrane bound form" 
/codon_jstart=3 

/product^" immunoglobulin delta-chain" 
/protein_id=* "AAA52771 .1" 
/db_xref = "GI : 4958 72 " 
/db_xr ef = » GDB : GO 0 - 1 2 0 - 0 84 " 

/translation* «■ PTKAPDVPPIISGCRHPKDNSPVVIiACLITGYHPTSVTVTWYMG 

TQSQPQRTFPEIQRRDSYYMTSSQLSTPLQQWRQGEYKCWQHTASKSKKEIPRWPBS 

PKAQASSVPTAQPQAEGSLAKAITAPATTRNTGRGGEE^^ 

PSHTQPLGVYLLTPAVQDLWLRDKATPTCFWGSDLKDAI^ 

LEPJISNGSQSQHSRLTLPRSLWNAGTSVTCTLNHPSLPPQR 

NLIiASSDPPI^SWLLCEVSGFSPPNILLAJWLBI^REVNTSGFAPARPPPQPRSTTPW 
AWSVLRVPAPPSPQPATYTCWSHEI)SRTLia^ 
CTTFDDVGSLWTTLSTFVALFILTLLYSGIVTFIKVX" 
^ joinlK02875. 1:101. .403, K02876. 1:101. . 202 , 101 . . 172, 

K02878. 1:101. .424,10)2979.1:101. . 424 , K02880 . 1 : 25 . . 53 ) 

/partial 

/gene=' l IGHP n 

/note«" secreted form" 

/codon^start=*3 

/product-" immunoglobulin delta-chain" 
/prot ein_id- "AAA5 2770.1" 
/db_xr e£- " GI : 4 9 B 8 7 1 ■ 
/db_xxef m "GDB ; GOO - 120 - 084 " 

/translation^ v PTKAPDVFPI I SGCRHPKDNS PWLACLI TGYHPTS VTVTWYMG 
TQSQPQRTPPEIQRRDSYYMTSSQLSTPIiQQWRQGEYKCWQHTASKSKKEIFRWPES 
PKAQAS3VPTAQFQJU2GSLAKA1TAP 

PSHTQPLGVYLLTPAVQDLWLRDKATFTCFWGSDLKDAKLTVJEVAGKVPTGGVEEGL 
LERHSUGSQSQHS RLTLPRS L>^AGTSVTCTLNHPS LPPQRLMALRE PAAQAP VKLS L 
NLIASSDPPEAASWLLCEVSGF5PPNILLMVJLEDQREVNTSGFAPARPPPQPRSTTFW 
AMSVLR^APPSPQPATYTCVVSHEDSRTLL^SRSLEVSYVTDHGPMK 1 ' 
BASE COUNT 102 a 52 c 70 g 76 t 

ORIGIN About 1.85 kb after segment 2. 

1 gtcattagct ggatttagcc attccacaat gtacacatat ttcaaacatt gtgttgtata 
61 tgataaacat gtataatttt tgtcaattaa aaatttttag gaagaggagg agaagagaag 
121 aagaaggaga aggagaaaga ggaacaagaa gagagagaga caaagacacc aggttttttc 
181 tgacccctgg gctatcaaaa cacctattgc ccaataacta gttggccgtt ggtgccctaa 
241 actattgaag cgattgctgt tatgtggatg ggccccggac acttagaaac tcgtgacccc 

K02B7B. Human germline IgD. [gi : 184768] PubMed, Protein, Related Sequences, Taxonomy 
OMIM, LinkOut 

LOCUS HUMIGCH04 500 bp DNA PRI 08-NOV-1994 

DEFINITION Human germline IgD chain gene, c- region, C-delta-2 domain 
ACCESSION K02878 
VERSION K02878.1 GI: 184768 

KEYWORDS C-region; germline; immunoglobulin heavy chain; 

immunoglobulin-delta . 
SOURCE Homo sapiens £individual_isolate Chronic lymphocytic leukemia <CLL) 

patient) dna. 
ORGANISM Homo sapiens 

Eukaryota ; Metazba; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; " Homo. 
REFERENCE 1 (bases 1 to 500) 

AUTHORS White, M.B., Shen,A.L. , Word,C.J., TucJcer,P.W. and Blattner,F.R. 
TITLE Human immunoglobulin D: genomic sequence of the delta heavy chain 
JOURNAL Science 228 (4700) , 733-737 (1935) 
MEDLINE 85192522 
COMMENT See segment 1. 
FEATURES Location/Qualifiers 
source i . .500 

/organism- "Homo sapiens" 

/isolate- "Chronic lymphocytic leukemia (CLL) patient" 
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intron 



exon 



intron 



intron 



gene 



CDS 



/db_xr ef = t axon :9$0S n 
/raap=' , 14g32 .33" 
/ eel 1_ t ype= n 1 yraphocy t e " 
■ /germline 
<1..100 
/gene="XGHD" 
/note=«G00-120-084" 
/number«3 
101.. 424 
/gene= n IGHD n 

/n6te= n C-delta-2 domain; GOO-120-084" 
/numbers 
425..>500 
/gene="IGHD» 
/note= "GOO-120-084 a 
/number>=4 
425..>500 
/gene="IGHD n 
/note» rt lgD-s intron D" 

join[K02875.1:l. .495 , XD2876 . 1 : 1 . . 300, K02877 . 1 r 1 . .300, 
1. .500,K02879.1:1. . 50D, K02880 . 1 : 1 . . 100, K02881 . 1 : 1 .200 
K02882.1:!. .52) 
/gene«"IGHD n 

join ( K02875 .1 : 101 .. . 4 03 , K02876 . 1 ; 101 . . 202 , 
K02877. 1:101.. 172,101. . 424 , K02879 . 1 : 101 . .424, 
K02881.1:25. . 182 # K02832 .1 : 44 . . 52 ) 
/partial 
/gene="IGHD n 

/not e= "membrane bound form n 
/codon_start=*3 

/products "immunoglobulin delta-chain" 
/protein_id="AAA52771. 1" 
/db_xref= ,r GI : 495872 * 
/db__xref = "GDB : GO 0 - 120 - 084 " 

/ translation= " ptk^dvfpiisgcrhpkdnspvvlaclitgyhptsvtvtw™g 

T0SQPQRTPPEI0RRDSY7MTSSQLSTPLQQWRQGEYKCW0HTASKSIOCBIFRWPBS 
PKAQAS S VPTAQ PQAEGS IJUCATTAPATTRNTGRGGEEKKKE KE KEEQEERETKTPE C 
PSHTQPLGVYLLTPAVQDLWLRDKATFTCFWGSDL^ 
LERHSNGSQSQHSRLTLPRSLt^AGTSVTCTliNHPS^ 

NLI^SDPPEAASWLLCEVSGFSPPNILIiMWLEDQREVNTSGFAPARPPPQPRSTTFW 
AWSVLRVPAPPSPQPATYTCWSHEra^ 
YTTFDDVtSSLWTTLSTFVAlrFI LTLLYSGI VTFI KVK 1 ' 
join CK0287S .1 : 101 . 403,K02876 . 1:101 . . 202 , 
K02877. 1:101. .172,101. . 424 , K02879 . 1 : 101 . .424 
K02880.1x25. .53) 
/partial 
/gene=»IGHD" 
/no te=" secreted form* 
/codon_start=3 

/product = immunogl obul in del t a - cha in" 
/protein_id=»AAA52770. l» 
/db_ocref s »GI : 495871 » 
/dJb__xref=s M GDB: GOO -12 0-084" 

/translations "PTKAPDV?PIISGCRHPKDNSPVVLACLITGYHPTS\n7VTWYMG 

TQSQPQRTFPEIQRRDSYYMTSSQLSTPLQQWRQGEYKCWOHTASKSKKEIFRWPES 

PKAQASSVPTAQPQAEGSLAXAITAPATTRNTGRGGEEKKKEKEKEEQEERETKTPEC 

PSHTQPIX3VYLLTPAVQDLWI^DKATFrc 

LERHSNGSQSQHSRLTLPRSLIWAGTSVTCT^ 

NLIASSDPPEAASWLLCEVSGF9PPNILLMMLEDQREVNTSGFAPARPPPQPRSTTFW 
AWSVLRVPAPPSPQPATYTCVV3HEDSRTLLNASRSLEVSYVTDHGPMK" 
BASE COUNT 93 a 171 c 157 g 79 t 

ORIGIN About 450 bp after segment 3; 131 bp upstream of AccI site. 

1 gaagctgggg agaggagagc acagtggtta agtcagtccc tgcagcccaa ctgctcccga 



CDS 
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61 aggtccggcc 
121 cctcttggcg 
181 accttcacct 
241 gctgggaagg 
301 tcccagagcc 
361 gtcacctgca 
421 cccggtgagc 
481 gcccgcccca 



acagctgctc 
tctacctgct 
gcttcgtggt 
tccccacagg 
agcacagccg 
caccgaacca 
ctggctccca 
gggccatgac 



tcgtttgctc 
aacccctgca 
gggcagtgac 
gggcgtggag 
tctgaccctg 
tcccagcctc 
ggtggggaga 



tcccctgcag 
gtgcaggacc 
ctgaaggatg 
gaagggctgc 
cecaggtcct 
ccaccccaga 

cgagggtgcc 



agtgtccgag 
tgtggctccg 
ctcacctgac 
tggagcggca 
tgtggaacgc 
ggttgatggc 
cacagcctgc 



ccacacccag 
ggacaaagcc 
ctgggaggtg 
cagcaacggc 
ggggacctcc 
gctgagagaa 
tgacccctac 



K02879. Human Ig germline 
OMIM, LinkOut 



. . [gi:184769] PubMed, Protein, Related Seguences, Taxonomy, 



LOCUS 
DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 

COMMENT 

FEATURES 

source 



mtron 



intron 



gene 



CDS 



HUMIGCH05 . 500 bp DNA pri 08-NOV-1994 

Human Ig germline delta E-chain C-region gene, C-delta-3 domain 
(CLL lymphocyte) . 
K0287? 

K02879.1 GI: 184769 

C- region; germline; immunoglobulin heavy chain; 
immunoglobulin- delta. 

Homo sapiens (individual_isolate Chronic lymphocytic leukemia (CLL) 
patient) DNA. 
Homo sapiens 

Eukaryota; Metazoa; Chordata, Craniata; Vertebrafca, Euteleoetomi , 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 {bases 1 to 500) 

White, M.B., Shen,A.L., Word,C.J., Tucker, P. W. and Blattner,F.R. 

Human immunoglobulin D: genomic sequence of the delta heavy chain 

Science 228 (4700), 733-737 (1985) 

85192522 

See segment 1. 

Location/Qualifiers 

1- .500 

/organism= H Homp sapiens" 

/isolate=»Chronic lymphocytic leukemia (CLL) patient" 

/dbxre " taxori : 96 0 6 " 

/map=''14q32.33" 

/cell_type= "lymphocyte" 

/germline 

<1..10d 

/gene="IGHD" 

/note="G00-120-084 " 

/numbers 

101. .424 

/gene="IGHD" 

/note=»C^delta-3 domain; G00^120-084; putative" 

/number^ 5 

425. .>500 

/gene="lGHD n 

/note= r, G00-120^084 1 ' 

/number= 5 

join (K02875. 1:1. .495, K02876 . 1 :1 . . 300,KC2877 . 1 : 1 . . 300, 

K02878. 1:1. .500,1- 500^02880.1:1. .100, K028B1. 1:1. .200 
K02882.1:l. .52) 

/gene= ,, iGHD B 

join (K02875 . 1: 101 . . 403, K02876 . 1 : 101 . .202 , 

K02877. 1:101.. 172, K02B78. 1:101. .424,101. .424, 

K02881. 1:25. .182, K02382. 1:44. .52) 

/partial 

/gene^lGHD" 

/no te= "membrane bound form" 
/codon^start-3 

/product= "immunoglobulin delta-chain" 
/protein_id«"AAA5277i .1" 
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/db_xref« M GI: 495872° 
/db_xre f = " GDB : GO 0 - 1 2 0 - 0 B 4 ° 

/translation^ "PTKAPDVFPI ISGCRHPKDNSPVVLACLITGYHPTSVTVTWYMG 
TQSQPQRTFPEIQRRDSYYPH'SSQLSTPIjQQWRQGEYKCW 

PKAQAS S VPTAQPQAEGS LAKATTAP ATTRNTGRGGEEKKKBKEKEEQEERETKTPE C 
PSHTQPLGVYIjLTPAVQDLWLRDKATFTCFWGSDLKDAKLT^^ 

NLLAS SDP PE AASWLLCE VSGFS PPNI LLMWLEDQ GFAP ARPPPQPRSTTFW 

AWSVLRVPAPPSPQPATYTCWSHEDSRTLLNASRSLEVSYLAMTP^^ 
YTTFDDVGSLWTTLSTFVALFILTLLYSGIVTFIKVK" 
GDS join ( K02875 . 1 : 101 ...403 , K02876 .1:101.. 202 , 

K02877. 1:101. .172,K02878. 1:101. .424, 101 . .424, 

K02880.1:25. .53) 

/partial 

/geiie="IC3HD" 

/note= n secreted form" 

/codon_start=3 

/product=" immunoglobulin delta-chain" 
/protein_id« "AAA52 770 . 1 » , 
/db_xref ~ " GI : 4 9 5 8 7 1 " 
/db_xref * " GDB : GO 0 - 12 0 - 08 4 " 

/translation^ •> PTKAPDVFPIISGCRHPKDNSPVVIACLiTGYHPTS 

tqs qpqrtfpei qrrds yymtssql»stpi/qqwrqge ykcwqhtas ks kkei frwpe s 
pkaqassvptaqpo^gsiakattapattr^ 

ps htqplgvylltpavqdlwlrdkatftc fwgsdlkdahltvievag kvptggveegl 
lerhsngsqsqhsrltlprsl17nagtsvtctlkhpslppqrlmalrepaaqapvklsl 
nliassdppeaaswlix:evsgfsppnillmwledqrevntsgfaparpppqprstt^ 
awsvi.rvpaf pspqpatytcwshedsrtlwjasrslevsyvtdhg^mk " 

BASE COUNT 85 a 188 c 145 g 92 t 

ORIGIN About 150 bp after segment 4; 118 bp upstream of Hindlll site. 

1 ccacaggaaa ggagaaggga ggcaccacac cctggccggc cccacttctc tcccagtgcc 
51 cccgtggcca gagcctgaca gcccccccac ctccccgcag ctgcgcaggc acccgtcaag 
121 ctttctctga acctgctggc ctcgtctgac cctcccgagg cggcctcgtg gctcctgtgt 
131 gaggtgtctg gcttctcgcc ccccaacatc ctcctgatgt ggctggagga ccagcgtgag 
241 gtgaacactt ctgggtttgc ccccgcacgc ccccctccac agcccaggag caccacgttc 
301 tgggcctgga gtgtgctgcg tgtcccagcc ccgcccagcc ctcagccagc cacctacacg 
351 tgtgtggtca gccacgagga ctcccggact ctgctcaacg ccagccggag cctagaagtc 
4 21 agctgtgagt cacccccagg ccagggttgg gacggggact ctgagggggg ccataaggag 
431 ctggaatcca tactaggcag 

K01311. Human IgD germline. . . [gi:184716] PubMed, Protein, Taxonomy, OMIM 

HUMIGCB9 106 tip DNA PRI 12-APR-2001 

Human IgD germline chain J- delta region: C- delta CHI 
K01311 

K01311.1 GI: 184716 

C- region; germline; immunoglobulin heavy chain; 
immunoglobulin -delta, 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases l to 106} 

Rabbitts,T.K. . Forster,A. and Miletein,C.P. 

Human immunoglobulin heavy chain genes: evolutionary comparisons of 
C mu, C delta and C gamma genes and associated switch sequences 
Nucleic Acids Res. 9 (18) , 4509-4524 (1981) 
82059479 
6795593 

The deduced amino acid sequence is compared in [1] to the 
J/C-delta-i junction of human ER1 protein. The delta gene occurs 
on3,y 5 kb from the mu region. The authors [1] could not detect any 
switch- related sequences adjacent to the delta gene and state that 
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gene 
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CDS 



this implies that the rau/delta switch cannot occur by the class 
switch recombination method. They speculate that the entire 
vh- (c-mu) - (c-delta) region is transcribed into one nuclear 
precursor molecule which is spliced later. 

This is part of a multigene region containing the J- region, . switch 
region, C-rau- secreted, C-nrn^membrane, and C-delta genes. 

Location/Qualifiers 

1..106 

/organi sm- "Homo sapi ens » 
/db_xre f = " t axon : 96 0 6 ■ 
/map= n 14g32.33 M 
/eel l_type= " lymphocyte » 
/tissue_type= "placenta" 
/ 1 i s sue_type=» n 1 i ve r « 
/dev_etage» "foetus " 
/germline 

/tissue^lib^of Lawn et al." 
1..106 

/gene»"iGHD n 

<1..26 

/gene= n lGHD" 

/note»" intron delta J-C; G06-120-084'» 

<27 . . >106 

/gene= n IGHD" 

/note= M C- region CHI domain" 
/codon_start=3 

/product =" immunoglobul in del t a - chain » 
/protein_id= n AAB59423 . 1" 
/db_xref=»GI: 184735" 

/translation^ " PTKAPDVFPI ISGCRHPKDNSPVVLA 
38 c 24 g 20 t 



24 



BASE COUNT 
ORIGIN 

1 tgccacccca ggactctgtc ttccagcacc caccaaggct ccggatgtgt tccccatcat 
61 atcagggtgc agacacccaa aggataacag ccctgtggtc ctggca 

OMIM 8 °Lin1Sut 9ermline I ^...Cgi: 184770] PubMed, Protein, Related Sequences, Taxonomy, 
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HUMIGCH06 100 bp DNA PR3 08-NOV-1994 

Human germline IgD chain gene, C-region, secreted terminus 
K02880 

K02880.1 GI:184770 

C-region; germline; immunoglobulin heavy chain; 
immunoglobul in-del ta . 

Homo sapiens <individual_isolate Chronic lymphocytic leukemia (CLL) 
patient) dna. 
Homo sapiens 

Eukaryota; Metazoa,- Chordata; Craniata; Vertebrata; Euteleostomi- 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo 
1 (bases l to 100) 

White, M.B., Shen,A.L., Word,C.j., Tucker, P. w. and Blattner,F.R 

Human immunoglobulin b: genomic sequence of the delta heavy chain 

Science 228 (4700) , 733-737 (1985) 

85192522 

See segment 1. 

Location/Qualifiers 
1...3.00 

/ organ i sm* "Homo sapiens 0 

/isoiate=»"Chronic lymphocytic leukemia (CLL) patient" 

/db_xref = "taxon : 9606 » 

/map-"l4q32.33 B 

/cell_type= » lymphocyte ■ 

/germline 
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intron 



gene 



CDS 



BASE COUNT 
ORIGIN 



<1. . >100 
/gene^'TGHD- 
/note="G00-120-084" 
/number=5 

<1..24 

/genes^iGHD" 
/note«"G00-120-084 n 
/humber» 5 
25. .>53 
/gene=»IGHD n 

/note=" secreted terminus domain; G00^120-084" 
/number =S 

joiliCK02875.1:l. .495 , K0287S . 1 : 1 J .300, K02877. 1 : 1 . .300 
K02878.1tl. .500,K02879.1:1. . 500 , 1 . . 100, K02831 . 1 : 1 . . 200 , 
K02882.1:!. .52) 
/gene-'TGHD" 

join (K02875. 1:101. . 403 , K02876 . 1 : 101 . .202, 

K02877. 1:101. .172,K02378. 1:101. .424, K02879.' :101 424 
25. .53) ' ' 

/partial 

/genc= , 'IGHD n 

/note- "secreted form" 

/codon_start=3 

/pro<iucfc= "inrnrunoglobulin delta-chain" 
/prbtein_id=*"AAA52770 . 1" 
/db^xref«»GI : 495871 " 
/db_xre f = " GDB : GO 0 - 1 2 0 - 0 04 " 

/translations " PTKAPDVFPI ISGCRHPKDNS PWLACLI TG YHPTS VTVTWYMG 

TQSQPQRTFPEIQRRDSYYMTS3QLSTPLQQWRQGEYKCVVQHTASKSKKEIFRWPKS 
PKAQASSVPTAQPQAEGSLAKATTAPAT^^ 

PSHTQPLGVYLLTPAVQDLWIiRDKATFTCFWGSDLKDAHLTWEVAGKVPTGGVEEGL 
LERHSNGSQSQH5RLTLPRSLWNAGTSVTCTI^JH^ 

NLLAS SD PPE AASWLLCE VSGF3 PPNI IiLMWLEDQREVNTS GFAP ARP PPQPRSTT FW 
AWSVLRVPAPPSPQPATYTCVVSHEDSRTLLNASRSIiBVSYVTDHGPMK'' 
a 33 c 22 g 21 t 

8 kb after segment 5. 
1 gacacgccga ttttttgtta ttagatgtaa cagaccatgg ccccatgaaa tgatcccgga 
61 ccagatccgt ccgcacccgc cactcagcag ctctggccga 



About 
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HUMIGCH07 200 bp DNA PRi 08-NOV-1994 

Human germline IgD-chain gene, C- region, first domain of membrane 
terminus. 
K02881 

K02881.1 GI: 184771 

C- region; germline; immunoglobulin heavy chain,- 
immunoglobulin -delta. , 

Homo sapiens (individual_isolate Chronic lymphocytic leukemia (CLL) 
patient) DNA. ' 
■Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi • 
■Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 200) 

White, M.B., She^A.L,, Word,C,J., Tucker, P. W. and Blattner, F.R. 

Human immunoglobulin D: genomic sequence of the delta heavy chain 

Science 228 (4700) , 733-737 (1985) 

85192522 

See segment 1. 

Location/Qualifiers 
1,.200 

/organi s m» "Homo eapi ens " 
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exon 



intron 



gene 



CDS 



BASE COUNT 
ORIGIN 



/isolate^ "Chronic lymphocytic leukemia (CLL) patient" 
/dirxref = " taxon : 9606 ■ 
/map= ,, l4q32.33 ,, 
/ eel l_typs= "lymphocyte" 
/gerntline 
<1..24 
/gene="IGKD" 
/not e= "GOO- 120 - 084 « 
/numbers 5 
25-. 182 
/gene="lGHD'' 

/note=" first domain of membrane terminus; GOO-120-084; 
putative" 
/number=* 6 
183 . . >200 
/gene« 1, rGHD n 
/note«"G00-120-084 n 
/number=6 ' 

join;K02875.1:l. .495, K02876 . 1 : 1 . . 300, K02877 . 1 : 1. .300, 
K02878.1:!. .500,K02879.1 : 1. . 500 ,K02880 . 1 : 1 . . 100, 1 . 200 
K02882.1:!. .52) ■ ■ * 

/gene=»"IGHD" 

join:K02D75. 1:101. . 403, K02876. 1:101. .202, 
K02877.1;101. . 172 , K0287B . 1 : 101 . .424 K02879 . 1 : 101 . . 424 
25. . 182, K02882. 1:44. .52) 
/partial 
/gene="IGHD M 

/note= "membrane bound form" 
/codon_start=3 

/product ^'immunoglobulin delta-chain" 
/protein_id= ,, AAA52771.1» 
/db_xr e f= "GI : 4 9 5 8 72 » 
/ db__xre£= " GDB : GO 0 - 1 2 0 ^ 0 84 " 

/ 1 ransl at ion= " ptkapdvfpi isgcrhpkdns pwlacli tgyhptsvtvtwymg 

TQSQPQRrFPEIQRRDSYYMTSSQLSTPLQQWRQGEYKCWQHTASKSKICBIFRWPBS 
PKAQASSVPTAQPQAEGSLAKATTAPATTRNTGRGGEEaCKKEKEKEEQEE 
PSHTQPLGVYLLTPAVQDLWIJOKATFTCFWGSDLKDAHLTWEVAGKVPTGGVEEGL 
LEPJJSNGSOSQHSRLTLPPJSLVWAGTSVTCTLNHPSLPPQRIaMAliREPAAQAPVKLSL 
NLLAS SDPPEAASWLLCEVSGFS PPNI LLMWLEDQRBVNTSGFAPARPP ?Q PRSTT FW 
AWSVLRVPAPPSPQPATYT^/SHEDSRTLLKASRSLEVSYI^irPLIPQSKDENSDD 
YTTFDDVGSLWTTLSTFVALFILrLLYSGIVTFI KVK" 
37 a 72 C 49 g 42 t 

About 800 bp after segment 6. 
1 cgctcggccc ccgttcctcc ccagacctgg ccatgacccc cctgatccct cagagcaagg 
61 atgagaacag cgatgactac acgacctttg atgatgtggg cagcctgtgg accaccctgt 
121 ccacgtttgt ggccctcttc atcctcaccc tcctctacag cggcattgtc actttcatca 
1B1 aggtcagggg ageggecagg 



K028B2. Human germline 3gD. 
OMIM, LinkOut 
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HUMIGCHOB • 100 bp DNA pri 08-NOV-1994 

Human gerraline IgD-chain gene, C- region, second domain of membrane 
terminus. 
K02882 

K02882.1 GI: 184772 

C- region; germline,* immunoglobulin heavy chain; 
immunoglobulin- delta. 

Homo sapiens (individual isolate Chronic lymphocytic leukemia (CLL) 
patient) DNA. 
Homo sapiens 

Eukaryota; Metazoa; Chordata,- Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hbminidae; Homo. 
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REFERENCE 1 (bases 1 ' to 100) 

AUTHORS White, M.B., Shen,A.L., Word, C. J. , Tucker, P. w. and Blattner,F.R. 
TITLE Human immunoglobulin D: genomic sequence of the delta heavy chain 
JOURNAL Science 228 (4700), 733-737 (1985) 
MEDLINB B5192522 
COMMENT 3ee segment 1. 

FEATURES Location/Qualifiers 
source 1 . .100 

/organism=:"Homo sapiens" 

/isolate^ "Chronic lymphocytic leukemia (CLL) patient" 

/ db_xref = " t axon : 9 6 0 6 w 

/map= ! ,, 14q32.33" 

/ cell_type>= " lymphocyte" 

/germline 
intron <1 . . 43 

/gene= h IGHD n 

/note»"IgD-Mb" 

/number=6 
exon 44 . . >S2 

/gene^'IGHD" 

/note- "membrane- bound form (second domain of membrane 

terminus) ; GOO- 120- 084; putative" 

/number=7 

gene join (K02875 . 1 :1 . .435, JC02876. 1 ;1 . . 300,K02877 . 1 : 1 . .300, 

K02 878.1:1. .S00,KO2879.1:l. . 500 , K02880 . 1 : 1 . .100, 
K02881.1;l. .200,1. .52) 
/gene="IGHD n 

CDS join(K02875. 1:101. .403, K02876.1: 

K02877. 1:101. - 172 , K02878 . 1 : 101 . 
K0288i.l:25. .182,44 . .52) 
/partial 
/gene="IGHD" 

/note="merabrane bound form" 
/codon__start=3 

/products "immunoglobulin delta- chain" 

/pro t e i n_i d=* " AAA5 2 7 7 1 . l » 

/db_xref - " GI : 4 9587 2 B 

/ db__XT ef « " GDB : GO 0-120-0 84 " 

/ 1 rans 1 a t i on= " PTKAPDVFPIISGCRHPKDNSP WLACLITGYHPTS VTVTWYMG 
TQSQPQRTFPEIQRRDSYYMTSSQLSTPLQQWRQGEYKCWQHTASKSKKEIFRWPES 
PKAQASSVPTAQPOABGSIiAKATTAPATTRNTGSGGBEKKKEKEKEEQSERETKTPEC 
PSHTQPLt^YLLTPAVQDLWLRDKATFTCFWGSDbKDAHLTVIEVAGKVPTGGVEBGL 
LERHSNGSQSQHSRljTLPRSL)raAGTSVTC^^ 

NLIASSDPPEAASWLI^EVSGFSPPNILLMWLBDQRSVNTSGFAPARPPPQPRSTTFW 
AWS VLRVPAP PS PQ P AT YTCWSHEDSRTLLNASRS LEVS YLAMTPL I PQS KDENSDD 
YTTFDDVGSLWTTLS TFVALFI LTLL YSGI VTFIKVK " 
BASE COUNT 22 a 30 c 30 g IB t 

ORIGIN About 1.3 kb after segment 7. 

1 tcaggcttct agcccctgtc tgaccccagg ggctgtcttt caggtgaagt agccccagaa 
61 gagcaggacg ccctgtacct gcagagaagg gaagcagcct 



:lbl. .202, 

. 424, K02879. 1:101. .424, 



K02875. Human germline IgD. .. [gi: 184765] PubMed, Related Sequences, Taxonomy, OMIM, 
LinkOut 

LOCUS HUMIGCH01 495 bp DNA PRI C8-NOV-1994 

DEFINITION Human germline IgD chain gene, C-region, C-delta-1 domain. 
ACCESSION K02875 
VERSION K02875 .1 GI : 184765 

KEYWORDS C-region; germline; immunoglobulin heavy chain; 

immunoglobul in- delta . 
SOURCE Homo sapiens findividual__isolate Chronic lymphocytic leukemia (CLL) 

patient) DNA. 
ORGANISM Homo sapiens 



35/65 



WO 01/83529 



PCT/US01/13932 



FIGURE 7 



DD 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 

COMMENT 



FEATURES 

source 



gene 



xntron 
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CDS 



Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 495) 

White, M.B., Shen,A.L., word,c..j.. Tucker, P. W. and Blattner, F.R. 
Human immunoglobulin D: genomic sequence of the delta heavy chain 
Science 228 (4700) , 733-737 (1985) 
85192522 

Sequence in computer readable form and draft entry for [1] were 
kindly provided by M.B.White, 06-AUG-1985. 

The C-delta and delta- s exon boundaries were located by comparing 
the translated sequences with known AA sequences [l] . 

Location/ Qualifiers 

1. .495 

/ org anism= "Homo sapiens" 

/isolate-" Chronic lymphocytic leukemia (CLL) patient" 

/db_xref - " taxon : 96 06 " 

/map=» ,, 14q32.33 M 

/ eel l__type= " lymphocyte % 

/gerraline 

join(l. .49S,K02876 .1:1. . 300 f K02877 . 1 : 1 . .300, 
K02878.1:!. .500 , K02879 . 1: 1 . . 500,K02880 .1 :1 . . 100, 
K02861.1:l. .200,K02882.1:1. .52) 
/gene^'IOUD" 
<1..100 
/gene= n IGHD" 
. /note="J-C intron; G00-120-084" 
join (101. . 403 , K02876 . I : 101 . . 202 , K02877 . 1 : 101 . .172, 
K02878. 1:101. - 424 , K02879 . 1 : 101 . .424 , K02880 . 1 : 25 . .53) 
/partial 
/gene= n IGHD" 
/note= D secreted form" 
/ codon_s t ar t = 3 

/product^" immunoglobulin delta-chain" 
/protein_id="AAA52770 .1" 
/db_xr e f = » GI : 4 9 5 8 71 " 
/db__xref « « GDB : GO 0-12 0 r 0 84 " 

/ transla t i on» " ptkapdvfpi i sgcrhpkdnspwiaclitgyhptsvtvtwymg 

TQSQPQRTFPEIQRRDSYYMTSSQLSTPIjQQWRQGEYKCWQHTASKSKKEIFRWPES 

PKAQASS VPTAQPQAEGS I^KATTAPATTRNTGRGGEEKKKEKEKEEQEERETKTPE C 

PSHTQPLGVYLLTPAVQDLWIiRDKATFTCFWGSDLI05AHLTV^AGKVPTGGVEEGL 

LERHSNGSQSQHSRLTTJPRSLWNAGTSVTCTI^PSLPPQRI^^ 

NLLASSDPPEAASWLLCEVSGFSPPNIIiLMWLEDQREVNTSGFAPARPPPQPRSTTFW 

AWSVLRVPAPPSPQPATYTCWSHEXISRTLLNASRSLEVSYVTDHGPMK" 

101. .403 

/gene» M IGHD n 

/note="C-delta-l domain; G00-120-084; putative" 
/number= 1 

join(101 . .403,K02876.1: 101. . 202, K02877. 1:101 . .172, 

K02878. 1:101. .424, K02879. 1:101. .424, K02881. 1:25- .182, 

K02882.1:44. .52) 

/partial 

/gene^niGHD" 

/note= "membrane bound form" 
/codcn_start»3 

/products "immunoglobulin delta -chain 11 
/protein_id="AAA52771 . l » 
/db_xref «"GI :495872" 
/db_xref« t, GDB:G00-120-084 n 

/translation- " PTKAPDVFPI ISGCPJIPKDNSPVvIAClJlTGYHPTSVTVTWYMG 
TQSQPQRTFPEIQRRDSYYMTS S QLSTPLQQWRQGB YKCWQHTAS KS KKE I FRWPES 
PKAQASSVPTAQPQAEGSLAKATTAPATTP^TGRGGEEKKKEKEKEEQEERETICFPEC 
PSHTQ PLGVYLLTPAVQDLWLRDKATFTCFWGSDLKD AHLTWEVAGKVPTGGVE EGL 
LERHSNGSQSQHSRLTLPRSLWtlAGTSVTCTIimPSLPPQRU^REPAAOAPVKLSL 
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NLLAS SDP PEAAS WLLCEVS GFSPPN I LLMWLEDQREVNTSGFAPARPP PQPRSTTFW 
AWSVLRVPAPPSPQPATYTCWSHEDSRTLLNASRSLE^/SYLAMTPLIPQSKDENSDD 
YTTFDDVGSLWTTLSTFVALFILTLLYSGI VTFI KVK" 
intron 4p4..>495 

/gene="-iGHD" 

/note="G00-120-084" 

/humber=l 

BASE COUNT 114 a 179 c 120 g 82 t 

ORIGIN 182 bp upstream of SphI site; chromosome 14q32.3. 

1 tttccctgcc tcccgtcacc ctgccgccag ggcctctgcc ctgccctgcc ccttgtcctc 
61 aggtttccag cctcagactc ccactgtgtc tgtcttccag cacccaccaa ggctccggat 
121 gtgttcccca tcatatcagg gtgcagacac ccaaaggata acagccctgt ggtcctggca 
181 tgcttgataa ctgggtacca cccaacgtcc gtgactgtca cctggtacat ggggacacag 
241 agccagcccc agagaacctt ccctgagata caaagacggg acagctacta catgacaagc 
301 agccagctct ccacccccct ccagcagtgg cgccaaggcg agtacaaatg cgtggtccag 
361 cacaccgcca gcaagagtaa gaaggagatc ttccgcbggc caggtaggtc gcaccggaga 
421 tcacccagaa gggcccccca ggacccccag caccttccac tcagggcctg accacaaaga 
481 cagaagcaag ggctg 



VHI. HUMAN IG EPSILON CHAIN C REGION - HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>ep|P01354|EPC_HUMAN IG EPSILON CHAIN C REGION - Homo sapiens (Human) . 

1( ? 20 30 40 • 50 60 

I I I | | t 

ASTQSPSVFP LTRCCKNIPS NATSVTLGCL ATGYFFEPVM VTWDTGSLNG TTMTLPATTL 

70 80 90 , 100 110 126 

I • I i . | j | 

TLSGHYATIS* LLTVSGAWAK QMFTCRVAHT PSSTDWVDNK TFSVCSRDFT PPXVKILQSS 

130 140 150 ISO 170 160 

I I I ' | | I 

CDGGGHFPPT IQLLCLVSGY TPGTINITWL EDGQVMDVDL STASTTQEGE LASTQSELTL 

190 200 210 220 230 240 

I I I I I | 

SQKHWLSDRT YTCQVTYQGH TFEDSTKKCA DSNPRGVSAY LSRPSPFDLF IRKSPTITCL 

250 260 270 280 290 300 

1 i i I I *| 

WDLAPSXGT VNLTWSRASG KPVNHSTRKE EKQRN3TLTV TSTLPVGTRD WIBGETYQCR 

31< 3 320 330 340 350 360 

i I I | | | 

VTHPHLPRAL MRSTTKTSGP RAAPEVYAFA TPEWP3SRDK RTIiACXilQNF MPEDISVQWL 

37 | 380 390 400 410 420 

HNEVQLPDAR HSTTQPRKTK GSGFFVFSRL EVTRAEWEQK DEFICRAVHE AASPSQTVQR 

423 
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CODING SEQUENCE 

atcgactgga cctggatcct cttcttggbg gcagcagcca 
cagttggtgc agtctggggc tgaggtgagg aagcctgggg 
aacgcttctg gatacacctt catcgactcc tatatccact 
cacgggcttg agtgggtggg atggatcaac cctaacagtg 
agatttcagg gcagggtcac catgaccaga gacgcgtcct 
ctcagaagtc tgagatctga cgactcggcc gtgtttbact 
tggagtgatt" attataactt tgactactcg tacactttgg 
accgtcaccg tctcctcagc ctccacacag agcccatccg 
tgcaaaaaca ttccctccaa tgccacctcc gtgactctgg 
ttcccggagc. cggtgatggt gacctgggac acaggctccc 
ttaccagcca ccaccctcac gctctctggt cactatgcca 
tcggfiftgcgt gggccaagca gatgttcacc tgccgtgtgg 
gactgggtcg acaacaaaac cttcagcgtc tgctccaggg 
aacatcttac agtcgtcctg cgacggcggc gggcacttcc 
tgcctcgfcct ctgggtacac cccagggact atcaacatca 
gtcatggacg tggacttgtc caccgcctct accacgcagg 
caaagcgagc tcaccctcag ccagaagcac tggctgtcag 
gtcacctatc aaggtcacac ctttgaggac agcaccaaga 
agaggggtga gcgcctacct aagccggccc agcccgttcg 
cccacgatca cctgtctggt ggtggacctg gcacccagca 
tgctcccggg ccagtgggaa gcctgtgaac cactccacca 
aatggcacgt taaccgtcac gtccaccctg ccggtgggca 
gacacctacc agtgcagggt gacccacccc caccbgccca 
accaagacca gcggcccgcg bgctgccccg gaagtctatg 
ccggggagcc gggacaagcg caccctcgcc tgcctgatcc 
atctcggtgc agtggctgca caacgaggtg cagctcccgg 
cagccccgca agaccaaggg cbccggcttc ttcgtcbtca 
gccgaatggg agcagaaaga tgagttcatc tgccgtgcag 
tcacagaccg tccagcgagc ggtgtctgta aatcccggta 

QenBank 

L00022. Human ig active he. . . 



cgcgagtcca 

catcagtgag 

ggatacgaca 

gtggcacaaa 

tcagtacagc 

gtgcgaaaag 

acgtctgggg 

tcbtcccctt 

gctgcctggc 

tcaacgggac 

ccatcagctt 

cacacactcc 

acttcacccc 

ccccgaccat 

ccbggcbgga 

agggtgagct 

accgcaccta 

agtgtgcaga 

accbgttcab 

aggggaccgb 

gaaaggagga 

cccgagacbg 

gggccctcat 

cgtbbgcgac 

agaacttcat 

acgcccggca 

gccgccbgga 

tccatgaggc 

aatga 



ctcccagacg 

ggtctcctgc 

ggcccctggg 

ctatgctccg 

ctacatggac 

tgaccctttt 

ccaagggacc 

gacccgctgc 

cacgggctac 

aactatgacc 

gctgaccgtc 

atcgtccaca 

gcccaccgtg 

ccagctcctg 

ggacgggcag 

ggcctccaca 

cacctgccag 

ttccaacccg 

ccgcaagtcg 

gaacctgacc 

gaagcagcgc 

gatcgagggg 

gcggtccacg 

gccggagbgg 

gcctgaggac 

cagcacgacg 

ggtgaccagg 

agcgagcccc 



LOCUS 
DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 

SOURCE 



ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 

TITLE 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1725 



HUMIOHAE2 1920 bp DNA PRI 22-DEC-1994 

Human Ig active heavy chain epsilon- 1 gene, constant region. 
L00022 J00227 V00555 
L00022.1 GI: 185035 

C-region; epsilon- immunoglobulin,- immunoglobulin heavy chain; 
processed gene. 

Human myeloma cell line 266B1 DNA and cDNA to mRKA, clones 
H-Ig-epsilon-ll, lambda -epsilon- 1.2, pJJ71 , • pGET2 and K85/A12 (see 
comment) . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi • 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae,- Homo. 

1 (bases l to 1920) 
Flanagan,j.G. and Rabbitts,T.H. 

The sequence of a human immunoglobulin epsilon heavy chain constant 
region gene, and evidence for zhree non-allelic genes 
EMBO J . 1 (5), 655-660 (1982) 
84236029 

2 (bases 528 to 736; 1044 to 1138) 
Nishida,Y., Miki,T., Hisajima,H. and Honjo,T. 

Cloning of human immunoglobulin epsilon chain genes : evidence for 
multiple C epsilon genes 

Proc. Natl. Acad. Sci. U.S.A. 79 (12), 3833-3837 (1982) 
82247945 

3 (bases 1 to 1920) 

Kenteh, J.H., Molgaard,H.V. , Houghton, M., Derbyshire ( R . B Viney,J., 
Bell,L.O. and Gould,H.J. 

Cloning and sequence determination of the gene for the human 
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JOURNAL 
MEDLINE 
REFERENCE 
AUTHORS 

TITLE 

" JOURNAL 
MEDLINE 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
COMMENT 



FEATURES 

source 



conflict 



intron 



conflict 



conflict 



immunoglobulin epsilon chain expressed in a myeloma cell line 
Proc. Natl. Acad. Sci. U.S.A. 79 (21), 6661-6665 (1982) 
83065234 

4 (bases 98 to 1884) 

Seno,M. , KuroJcawa,T. , Ono,Y., Onda,H., Sasada.R., Igarashi,K_, 
Xlkuchi , M. , Sugino,Y., Niehida,Y. and Honjo f T. 

Molecular cloning and nucleotide sequencing of human immunoglobulin 
epsilon chain cDNA 

Nucleic Acids Res. 11 (3)* 719-726 (1983) 
83168897 

5 (bases 691 to 807/ 1571 to 1818; 1860 to 1885) 
Liu,£\T., Albrandt, K.A. , Bry,C.G. and Ishizaka,T. 

Expression of a biologically active fragment of human IgE epsilon 
chain in Escherichia coli 

Proc. Natl. Acad. Sci. U.S.A. 81 (17), 5369-5373 (1984) - 
84298140 

[23 and [1] report the isolation of two other epsilon genes, 
epsilon-2 and epsilon-3. The authors in [2] claim that epsilon-3 is 
a pseudogene. Compared in [4J with the germline C- region sequence 
by Max, et al (Cell 29, 691-699 (1982)), and there are three 
nucleotide differences. The deduced amino acid sequence in [4] 
differs somewhat from- the published C- region sequence. [5] produced 
expression of IgE in E. coli by insertion into expression vector 
pUC7 . 

Complete source information: 

Human myeloma cell line 266B1 DNA [2] , [1] , [5] and cDNA to mRNA {33 , 
[4], clones H-Ig-epsilon-11 [2] , lambda-epsilon-l . 2 [1], pjJ71 [3], 
p6ET2 14] and K85/A12 [5] . 

Location/Qualifiers 

2. .2920 

/organisms*' Homo sapiens" ■ t 

/db_xre f = " taxon : 9 6 0 6 " 
/map="14q32.33" 
<1. .1886 

/note= M epsilon-l mRNA" . 
<1 . .97 

/gene=»iGKE» 

/not e«" epsilon-l intron J-C" 

98 . .406 

/gene= M IGHE" 

/note="Ig heavy chain epsilon-l (CHI domain); GOO-119^3-5" 
407.. 613 
/genesTGHE" 

/ho te=" epsilon-l intron A" 
614. .934 
/gene^'IGHE" 

/note= M Ig heavy chain epsilon-l (CH2 domain)" 
735 

/gene^-IGHE" 
/citations [3] 
/replaces"" 
935.. 1020 
/gene^'IdlE" 

/note^ 1 ' epsilon-l intron B" 
1021.. 1344 
/gene^TGHE" 

/note= "Ig heavy chain epsilon-l (CH3 domain) n 

1124 

/gene=*IGHE" 
/citation- [3] 
/replace-" *' 
1337 

/gene* n I SHE" 



prim_transcript 
intron 



intron 
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HH 



intron 
exan 

conflict 
conflict 

conflict 

9ene 
CDS 



BASS COUNT 387 a 658 C 

origin About 3 k£> after '; 

l ggatccctgc cacggggtcc 
61 goctgaggct ggcactgact 
121 ttccccttga cccgctgctg 
181 tgcctggcca cgggctactt 
241 aacgggacaa ctatgacctt 
301 atcagcttgc tgaccgtctc 
361 cacactccat cgtccacaga 
421 agctcagaga ecacagttcc 
481 tgagggggtg ggtgggctca 
541 acgtgggggc aagaggaggg 
601 agctctcccc cagtctgctc 
661 tcctgcgacg gcggcgggca 
721 tacaccccag ggactatcaa 
781 ttgtccaccg cctctaccac 
841 ctcagccaga agcactggct 
901 cacacctttg aggacagcac 
961 gccacggagg ccagagaaga 
1021 attccaaccc gagaggggtg 
1081 tccgcaagtc gcccacgatc 
1141 tgaacctgac ctggtcccgg 
1201 agaagcagcg caatggcacg 



/citation- [3] 
/replace^ » « 
1345. .1427 
/gene= n lGHE" 

/not e= w eps lion -l intron C" 
1428 . . >1759 

/nbtee"ig heavy chain epsilon-1 (CH4 domain) * 
1444 . .1445 
/gene= tt IGHE» 
/citations [3] 
/replaces"" 

1612 

/gene= l, IGHE n 
/citation^ (33 
/replaces 
1785 

/citation= [3] 
/replace=" " 

join(L00021. 1:57. .495,1. .1758) 
/genes^IGHE" 
join(L0002l.l:57. 
1428. .1759) 
/partial 
/gene="IGHE n 

/note="Ig heavy chain epsilon-1 (V-D-J region)" 
/codon_3tart«l . 
/protein_icU= n AAB59424 . 1" 
/db_xref = »GI : 3 8 680 7 " 
/db_xre£=»GDB:G00^119-335 M 

/translation^ "raWTWILPLVAAATRVHSQTQLVQSQAEVRKPGAS VRVSCKASG 
YTF1D S YI HWI RQAPGHGL^/GWXt^NSGGlOTAPRPQGRVTMTRDAS FSTAYMDLR 
SI^SDDSAVTYCAKSDPFWSDYYNFDYSYTLDWGQGr^^ 

CKN I P SNATSVTIXUCIjATGYF P3 PVMVTWDTGS LNGTIMTLP ATTLTLSGH YATI S LL 
TVSGAWAKQMFTCRVAHTPSSrDWvI)NKTFSVCSRD?rPPTVKILQSSCDGGGHFPPT 
IQLLCLVSGYTPGTINITWLEDGQVMDVDLSTASTTQEGELASTQSELTLSQKHWLSD 
RTYTCQVTYQGHTFEDSTKKCADSNPRGVSAYLSRPSPFDLFIRKSPTITCLWDLAP 
SKGT\^TWSRASGKPViraSTRKEEKQRNGTIjTVT^ 

HLPRALiMRSTTICrS GPRAAPEVYAF AT PEWPGS RDKRTLACL 1 QNFMPED I S VQWLHN 
E VQLPDARHSTTQPRKTKGSGFFVFSRLBVTRAEWEQKDEFI CRAVHEAAS PSQTVQR 
AVSVNPGK" 



J . . 495 , 98 . .406, 614 . . 934 , 1021 . . 1344, 



576 g 299 t 
l bp upstream of BamHi site, 
ccagctcccc catccaggcc ccccaggctg atgggcgctg 
aggttctgtc ctcacagcct ccacacagag cccatccgtc 
caaaaacatt ccctccaatg ccacctccgt gactctgggc 
cccggagccg gtgatggtga cctgggacac aggctccctc 
accagccacc accctcacgc tctctggtca ctatgccacc 
gggtgcgtgg gccaagcaga tgttcacctg ccgtgtggca 
ctgggtcgac aacaaaacct tcagcggtaa gagagggcca 
caggagtgcc aggctgaggg ctggcagagt gggcaggggt 
aacgtgggaa cacccagcat gcctggggac ccgggccagg 
cacacagagc tcagagaggc caacaaccct catgaccacc 
cagggacttc accccgccca ccgtgaagat cttacagtcg 
cttccccccg accatccagc tccfcgtgcct cgtctctggg 
catcacctgg ctggaggacg ggcaggtcat ggacgtggac 
gcaggagggt gagctggcct ccacacaaag cgagctcacc 
gtcagaccgc acctacacct gccaggtcac ctatcaaggt 
caagaagtgt gcaggtacgt tcccacctgc cctggtggcc 
ggggcgggtg ggcctcacac agccctccgg tgtaccacag 
agcgcctacc taagccggcc cagcccgttc gacctgtfcca 
acctgtctgg tggtggacct ggcacccagc aaggggaccg 
gccagtggga agcctgtgaa ccactccacc agaaaggagg 
ttaaccgtca cgtccaccct gccggtgggc acccgagact 
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1261 ggatcgaggg ggagacctac cagtgcaggg tgacccaccc ecacctgccc agggccctca 
1321 tgcggtccac gaccaagacc agcggtgagc catgggcagg ccggggtcgt gggggaaggg 
1381 agggagcgag tgagcggggc ccgggctgac cccacgtctg gccacaggcc cgcgtgctgc 
1441 cccggaagtc tatgcgtttg cgacgccgga gtggccgggg agccgggaea agcgcaccct 
1501 cgcctgcctg atccagaact .tcaegccfcga ggacatctcg -gcgcagtggc tgcacaacga 
1561 ggtgcagctc ccggacgccc ggcacagcac gacgcagccc cgcaagacca agggctccgg 
1621 cttcttcgtc ttcagccgcc tggaggtgac cagggccgaa tgggagcaga aagatgagtt 
1681 catctgccgt gcagtccatg aggcagcgag cccctcacag accgtccagc gagcggtgtc 
1741 tgtaaatccc ggtaaatgac gtactcctgc ctccctccct cccagggctc catccagctg 
1801 tgcagtgggg aggactggcc agaccttctg tccactgttg caatgacccc aggaagctac 
1861 ccccaataaa ctgtgcctgc tcagagcc^c agtacaccca ttcttgggag cgggcagggc 

IX HUMAN IG MU CHAIN C REGION -HOMO SAPIENS (HUMAN). 

AMINO ACID SEQUENCE 

>sp|P0187l |MUC_HUMAU IG MU CHAIN C REGION - Homo sapiens (Human) . 

10 20 30 40 50 60 

GSASAPTLFP LVSCSNSPSD TSSVAVGCLA QDFLPDSITF SWKYKNNSDI SSTRGFPSVL 

70 80 90 100 1X0 120 

I I I | | | 

RGGKYAATSQ VLLPSKDVMQ GTDEHWCKV QHPNGNKEKN VPLPVIAELP PKVSVFVPPR 

130 140 150 160 170 180 

i I I | | I 

DG7FGNPRSK SKLICQATGF SPRQIQVSWL REGKQVGSGV TTDQVQAEAK ESGPTTYKVT 

• 1 9° 200 210 220 230 240 

I I | | | I 

STLTIKESDW LSQSMFTCRV DHRGLTFQQN ASSMCVPDQD TAIRVFAIPP SFASIFLTKS 

250 260 270 280 290 300 

1 I I I | | 

TKLTCLVTDL TTYDSVTISW TRQNGEAVKT HTNISESHPN ATFSAVGEAS ICEDDWNSG3 

310 320 330 340 350 360 

I I I | | | 

RFTCTVTHTD IiPSPLKQTIS RPKGVALHRP DVYLLPPARE QLNIiRESATI TCLVTGFSPA 



LPNRVTERTV DKSTGKPTLY NVSLVMSDTA GTCY 
CODING SEQUENCE 

atggactgga cctggaggtt cctctttgtg gtggcagcag ctacaggtgt ccagtcccag 60 

gtgcagctgg tgcagtctgg ggctgaggtg aagaagcctg ggtcctcggt gaaggtctcc 120 

tgcaaggctt ctggaggcac cttcagcagc tatgctatca gctgggtgcg acaggcccct 180 

ggacaagggc ctgagtggat gggagggatc atccctatct ttggtacagc aaactacgca 240 

cagaagttcc agggcagagt cacgattacc gcggacgaat ccacgagcac agcctacatg 300 

gagctgagca gcctgagatc tgaggacacg gccgtgtatt actgtgcgaa aaccgggatc 360 

ctggggccgt atagcagtgg ctggtacccg aactcggact acfcactacta cggtatggac 420 

gtctggggcc aagggaccac ggtcaccgtc tcctcaggga gtgcatccgc cccaaccctt 480 

ttccccctcg tctcctgtga gaattccccg tcggatacga gcagcgtggc cgttggctgc 540 

ctcgcacagg acttccttcc cgactccatc actttctcct ggaaatacaa gaacaactct 600 

gacatcagca gcacccgggg cttcccatca gtcctgagag ggggcaagta cgcagccacc 660 
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tcacaggtgc 
aaagtccagc 
ctgcctccca 
agcaagtcca 
tggctgcgcg 
gccaaagagt 
gactggctca 
cagaatgcgt 
cccccatcct 
gacctgacca 
aaaacccaca 
gccagcatct 
acagacctgc 
aggcccgatg 
accatcacgt 

asaeggcagc 

gccccaggcc 
ggggagacct 
accgtggaca 
tgggccaccfg 
gtcaccttgt 



tgctgccttc 
accccaacgg 
aagtgagcgt 
agctcatctg 
aggggaagca 
ctgggcccac 
gccagagcat 
cctccatgtg 
ttgccagcat 
cctatgacag 
ccaacatctc 
gcgaggatga 
cctcgccact 
tctacttgct 
gcctggtgac 
ccttgtcccc 
ggtacttcgc 
acacctgcgt 
agtccaccga 
cctccacctt 
tcaaggtgaa 



caaggacgtc 
caacaaagaa 
cttcgtccca 
ccaggccacg 

ggtggggtct 

gacctacaag 
gttcacctgc 
tgtccccgat 
ctfccctcacc 
cgtgaccatc 
cgagagccac 
ctggaattcc 
gaagcagacc 
gccaccagcc 
gggcttctct 
ggagaagtat 
ccacagcatc 
ggtggcccat 
9999gaggtg 
catcgtcctc 
atga 



atgcagggca 
aagaacgtgc 
ccccgcgacg 
ggtttcagtc 
ggcgtcacca 
gtgaccagca 
cgcgtggatc 
caagacacag 
aagtccacca 
tcctggacca 
cccaatgcca 
ggggagaggt 
atctcccggc 
cgggagcagc 
cccgcggacg 
gtgaccagcg 
ctgaccgtgt 
gaggccctgc 
agcgccgacg 
ttcctcctga 



cagacgaaca 
ctcttccagt 
gcttcttcgg 
cccggcagat 
cggaccaggt 
cactgaccat 
acaggggcct 
ccatccgggt 
agttgacctg 
gccagaatgg 
ctttcagogc 
tcacgtgcac 
ccaagggggt 
tgaacctgcg 
tcttcgtgca 
ccccaatgcc 
ccgaagagga 
ccaacagggt 
aggagggctt 
gcctcttcta 



cgtggtgtgc 
gattgctgag 
caacccccgc 
tcaggcgt cc 
gcaggctgag 
caaagagagc 
gaccttccag 
cttcgccatc 
cctggtcaca 
cgaagctgtg 
cgtgggtgag 
cgtgacccac 
ggccctgcac 
ggagtcggcc 
gtggatgcag 
tgagccccag 
atggaacacg 
caccgagagg 
tgagaacctg 
cagtaccacc 



720 
780 
340 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1580 
1740 
1300 
1860 
1384 



GenBank 

XI 7115. Human mRNA for IgM 
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HSIGM201 2213 bp mRNA pri 

Human mRNA for IgM heavy chain complete sequence . 
X17115 ' 
XI 71 15.1 GI: 33450 

lg heavy chain; IgM gene; IgM heavy chain; transmembrane protein, 
human. 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; vertebrata; Euteleoetomi- 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo 

1 (bases 1 to 2213) 
Friedlander , R . M. 
Direct Submission 

Submitted (03-NOV-1989) Friedlander R. M., Harvard Medical School 
Howard Hughes Medical Institute, Department of Genetics, 25 
Shattuck St, Boston, MA 02115," USA 

2 (bases 1 to 2213) 

Friedlander ( R.M. , Nussenaweig,M.C. and Lader,P. 

Complete nucleotide sequence of the membrane form of the human IqM 
heavy chain 3 
Nucleic Acids Res. 18 (14), 4278 (1990) 
90332450 

3 (bases 1 to 2213) 
Kristensen,T. , Lopez,R. and Pryd2,H. 

An estimate of the sequencing error frequency in the DNA seouence 
databases 

DNA Seq. 2 (6) , 343-346 (1992) 
93075997 

Erratum: { [published erratum appears in DNA Seq 1993 ;3 (5) :337]] 
For genomic sequence see <K01306>, <X14939> and 
<X14940>. The author reports various conflicts with these 
sequences. Data kindly reviewed (3Q-MAY-1990) by Friedlander R.M. 

Location/Qualifiers 

1. .2213 

/organism^ "Homo sapiens" 
/db__xref = n taxon : 9 6 0 6 " 
/clonerr-201-203'' 
/cell_line=" lymphoma 201" 
/cell_type= ,, B f * 
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FIGURE 7 KK 



/ 1 i s sue_type= " lymphoid " 
misc_f eature 1..39 

/notes "putative VECTOR sequence Bluescript SKP+" 

/citation=[3) 
CDS 73. .1956 

/not e=»" precursor (AA -15 to 612)" 

/codon_start«l 

/protein_id=»CAA34971.1» 

/db_xre£= M Gt: 33451" 

/db_xref= n SWISS-PR0T:P01871 n 

/db_xref= "SWISS- PRO! 1 : P20769 » 

/ 1 r ansl at ion= » ^WTW.^LFWAAATGVQSQVQLVQSG^^KKPGSSVKVSCKAS 

GGTFSSYAISWVRQAPGQGLEWMGGIIPIFGTANYAQKFQGRVTITADESTSTAYMEL 

SSLRSEDTAVYYCAKTGILGPYSSGWPNSDYYYYGM^GQGTTVTV 

FPLVS CENS PSDTSSVAVGCLAQDFLPDSITFSWKYKNNSDI S STRGFPSVLRGGKYA 

ATSQVLLPSKDVMQQTDEHWCKVQHPNGNKBKNVPLFVIAELPPKVSVFVT 

GNPRSKSKLICQATGFSPRQIQVSWLREGKQVGSGVTllDQVQAEAKESGPTTYia'TST 

LT I KE SDWLSQSMFTCRVDHRGLT FQQNASSMCVPDQDTAI RVFAI PPSFASI FLTXS 

TKLTCLVTDLTTYDSVTI S WTRQNGEAVKTHTUI 9ESHPNATFS AVGEAS I CEDDWNS 

GERFTCTVTHTDLPSPItKQTISRFKGVALHRPDVYLLPPAREQLNLRESATITCL 

FS PAD VFVQWMQRGQPLS P3KYVTSAPMPEPQAPGRYFAHS I LTVS BEE WNTGE T YTC 

V^AHEALPhJRVTERTVDKSTEGE^SADESGFENLWATASTFIVLFLLSLFYSTTVTLF 

KVK" 

sig_peptide 73 . . 117 

mat_peptide lie . . 1953 

/products "IgPI heavy chain (AA 1 to 612) " 
polyA_site 2213 

/note= n polyA site" 
BASE COUNT 462 a 708 c 629 g 414 t 

ORIGIN 

1 gctctagaac tagtggatcc cccgggctgc aggaattctc taaagaagcc cctgggagca 
61 cagctcatca ccatggactg gacctggagg ttcctctttg tggtggcagc agctacaggt 
121 gtccagtccc aggtgcagct ggtgcagtct ggggctgagg tgaagaagcc tgggtcctcg 
1B1 gtgaaggtct cctgcaaggc ttctggaggc accttcagca gctatgctat cagctgggtg 
241 cgacaggccc ctggacaagg gcttgagtgg atgggaggga tcatccctat ctttggtaca 
3 01 gcaaactacg cacagaagtt ccagggcaga gtcacgatta ccgcggacga atccacgagc 
361 acagcctaca tggagctgag cagcctgaga tctgaggaca cggccgtgta ttactgtgcg 
421 aaaaccggga tcctggggcc gtatagcagt ggctggtacc cgaactcgga ctactactac 
431 tacggtatgg acgtctgggg ccaagggacc acggtcaccg tctcctcagg gagtgcatcc 
541 gccccaaccc ttttccccct cgtctcctgt gagaattccc cgtcggatac gagcagcgtg 
601 gccgttggct gcctcgcaca ggacttcctt cccgactcca tcactttcte ctggaaatac 
651 aagaacaact ctgacatcag cagcacccgg ggcttcccat cagtcctgag agggggcaag 
721 tacgcagcca cctcacaggt gctgctgcct tccaaggacg tcatgcaggg cacagacgaa 
731 cacgtggtgt gcaaagtcca gcaccccaac.ggcaacaaag aaaagaacgt gcctcttcca 
841 gtgattgctg agctgcctcc caaagtgagc gtcttcgtcc caccccgcga cggcttcttc 
901 ggcaaccccc gcagcaagtc caagctcatc tgccaggcca cgggtttcag tccccggcag 
961 attcaggtgt cctggctgcg cgaggggaag caggtggggt ctggcgtcac cacggaccag 
1021 gtgcaggctg aggccaaaga gtctgggccc acgacctaca aggtgaccag cacactgacc 
1031 atcaaagaga gcgactggct cagccagagc atgttcacct gccgcgtgga tcacaggggc 
1141 ctgaccttcc agcagaatgc gtcctccatg tgtgtccccg atcaagacac agccatccgg 
1201 gtcttcgcca tccccccatc ctttgccagc atcttcctca ccaagtccac caagttgacc 
1261 tgcctggtca cagacctgac cacctatgac agcgtgacca tctcctggac ccgccagaat 
1321 ggcgaagctg tgaaaaccca caccaacatc tccgagagcc accccaatgc cactttcagc 
1381 gccgtgggtg aggccagcat ctgcgaggat gactggaatt ccggggagag gttcacgtgc 
1441 accgtgaccc acacagacct gccctcgcca ctgaagcaga ccatctcccg gcccaagggg 
1501 gtggccctgc acaggcccga tgtctacttg ctgccaccag cccgggagca gctgaacctg 
1561 cgggagtcgg ccaccatcac gtgcctggtg acgggcttct ctcccgcgga cgtcttcgtg 
1621 cagtggatgc agagggggca gcccfctgtcc ccggagaagt atgtgaccag cgccccaatg 
1681 cctgagcccc aggccccagg ccggtacttc gcccacagca tcctgaccgt gtccgaagag 
1741 gaatggaaca cgggggagac ctacacctgc gtggtggccc atgaggccct gcccaacagg 
1801 gtcaccgaga ggaccgtgga caagtccacc gagggggagg tgagcgccga cgaggagggc 
1861 tttgagaacc tgtgggccac cgcctccacc ttcatcgtcc tcttcctcct gagcctcttc 
1921 tacagtacca ccgtcacctt gttcaaggtg aaatgatccc aacagaagaa catcggagac 
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FIGURE 7 LL 



1981 cagagagagg aactcaaagg ggcgctgcct ccgggtctgg ggtcctggcc tgcgtggcct 

2041 gttggcacgt gtttctcttc ccgcccggcc tccagttgtg tgctctcaca caggcttcct 

21D1 tctcgaccgg caggggctgg ctggcttgca ggccacgagg tgggctctac cccacactgc 

21S1 tttgctgtgt atacccttgt tgccctgaaa taaacatgca cattttatcc atg 
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Hindlll sail Clal 

1 GAACTCGAGCAGCTGAAGCTTGCATGCCTGCAGGTCGACGGTATCGATAAGGATCC 
8 1 ATCraCAAATTKCTTTTOT 

161 TMTMGcrcre^ 

241 TGTACGGCTAAGAGCGAATTTGGCtiTGT^^^ 

321 ATCGACCATGTACGTAAGCGCTTACGTTOTTGGTGGAC 

401 ATGGATCATTAATTTCCACCTTCACCTACGATGGG^^ 




« w w^^-x x w>j iiiAXAi 1AUWUTAAUAGCGAATTTGGCCTGTAGGATCCGCGAGCT 

721 GGTCAATCCCATTGCTTTTGAAGCAGCTCAACATTGATCTCTOT 
801 GACGTAAGTATCCGAGTCAGTTTTTATTTTTC^^ 
881 TGAATTTCGCGGGTATTCTGTTTCTATTCCAACTCT 
961 eACGraVAGCCTCOCTA^^ 
L041 GCCATTTCGCCTTTTCAGAAATGGATAAATAGCCTTGCTTCCTATTATATOT 

Clal 

L121 ATCTGAATTTCATAACCAATCTCGATACACCAAATC^^ 



Spel 

L198 AAACGTTTTTTQTCTCTTCTTtCATTGTGATO 

LysProPheLeuSerLeuIieuSerLeuSerLeiUieToLeuPheThrSerThrSerLeu 
L2S5 GCACAGACATCTGTGTCCCCCTCAAAAGTCATCCTGCCCCGGGGAGGCTCCGTG 

AlaGlnThrSerValSerProSerLysVallleLeuProArgGlyGlySerVal 
L309 CTGGTGACATGGAGCACCTCCTGTGACCAGCGCAAGTTGTTGGGCATAGAGACCCCGTTG 

LeuValThrCysSerThrSerCysAspGlnProLysLeuLeuGlylleGluThrProLeu 
L3 69 CCTAAAAAGGAGTTGCTCCTGCCTGGGAACAACCGGAAGGTGTATGAACTGAGGAATGTG 

ProLysLysGluLeuLeuLeuProGlyAsnAsnArgLysValTyrGluLeuSerAsnVal 
L42 9 CAAGAAGATAGCCAACCAATGTGCTATTCAAACTGCCCTGATC 

GlnGluAspSerGlnProMetCysTyrSerAsnCysProAspGlyGlnSerThrAlaLys 
L4.8 9 ACCTTCCTCACCGTGTACTGGACTCCAGAACGGGTGGAACTGGCACCCCTCCCCTCTTGG 

ThrPheLeuThrValTyrTrpThrProGluArgValGluLeuAlaProLeuProSerTrp- 
L54 9 CAGCCAGtGGGC^GAACCTTAGGGTACGCTGCCAGGTGGAGGGTGGGGCACCCCGGGCC 

GlnProValGlyLysAsnLeuThrLeuArgCysGlnValGluGlyGlyAlaProArgAla 
1609 AACCTCACCGTGGTGCTGPTCCGTGGGGAGAAGGAGCTGAAACGGGAGCCAGCTGTGGGG 

AsnLeuThrValValLeuLeiiArgGlyGluLysGluLeuLysArgGluProAlaValGly 
L669 GAGCCCGCTGAGGTCACGACCACGGTGCTGGTGAGGAGAGATCACCATGGAGCCAATTTC 

GluProAlaGluValThrThrThrValLeuValArgArgAspHisHisGlyAlaAsnPhe 
L72 9 TCGTGCGGCACTGAACTGGACCTGCGGCCCCAAGGGCTGGAGCTGTTTGAGAACACCTCG 

SerCysArgThrGluLeuAspLeuArgProGlnGlyLeuGluLeuPIieGlijAsnXhrSer 
L7 8 9 GCCCCCTACCAGCTCCAGACCTTTGTCCTGCCAGCGACTCCCCCACAACTTGTCAGCCCC 

AlaProTyrGlnLeuGlnThrPheValLeuProAlaThrProProGlnLeuValSerPro 
L849 CGGGTCCTAGAGGTGGACACGCAGGGGACCGTGGTCTGTTCCCXGGAGGGGCTGTTCCCA 

ArgValLeuGluValAspThrGlnGlyThrValValCysSerLeuAspGlyLeuPhePro 
L90 9 GTCTCGGAGGCCCAGGTCCACCTGGCACTGGGGGACGAGAGGTTGAACCCCACAGTCACC 

ValSerGlxiAlaGlnValHisLeuAlaLeuGlyAspGlnArgLeuAsnProThrValThr 
1969 TATGGCAACGACTCCXTCTCGGCCAAGGCCTCAGTCAGTGTGACCGCAGAGGACGAGGGC 

TyrGlyAsnAspSerPheSerAlaLysAlaSerValSerValThrAlaGluAspGluGly 
2029 ACCCAGCGGCTGACGTGTGCAGTAATACTGGGGAACC AGAGCCAGGAGACACTGCAGACA 

ThrGlnArgLeuThrCysAlaVallleLeuGlyAsnGlriSerGlnGluThrLeuGlnThr 
2089 GTGACCATCTACAGCTTTCCGGCGCCCAACGTGATTCTGACGAAGCCAGAGGTCTCAGAA 

ValThrlleTyrSerPheProAlaProAsnVallleLeuThrLysProGluValSerGlu 
2149 GGGACCGAGGTGACAGTGAAGTGTGAGGCCCACCCTAGAGCCAAGGTGACGCTGAATGGG 

GlyThrGluValThrValLysCysGluAlaHisProArgAlaLysValThrLeuAsnGly 
2209 GTTCCAGCCCAGCCACTGGGCCCGAGGGCCCAGCTCCTGCTGAAGGCCACCCCAGAGGAC 
ValProAlaGlnProLeuGlyProArgAlaGlnLeuLeuLeuLysAlaThrProGIuAsp 
2269 AACGGGCGCAGCTTCTCCTGCTCTGCAACCCTGGAGGTGGCCGGCCAGCTTATACACAAG 
AsnGlyArgSerPheSerGysSerAlaThrLeuGluValAlaGlyGlnLeuIleHisLys 
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2329 AACCAGACCCGGGAGCTTGGTGTCCTGTATGGCCCGCGACTGGACGAGAGGGATTGTCCG 
AsnGlnThrArgGluLeuArgValLeuly^ 

2389 GGAAACTGGACGTGGCCAGAAAATTCCCAGCAGACTCCAATGTGCCAGGCTTGGGGGAAC 

GlyAsnTrpThrTrpProGluAsnSerGlnGlnThrProMetCysGlnAlaTrpGlvAsn 
Sad 

2449 CC^TTGCCCGAGCTCAAGTGTCTAAAGGATGGCACTTTCCCACTGCCCATCGGGGAATCA 
ProLeuProGluLeuLysCysIieuLysAspGlyThrPheProLeuProIleGlyGluSer 
2509 GTGACTGTCACTOGAGATCTTGAGGGCACCTACC 

ValThrValThrArgAspLeuGluGlyThrTyrLeuCysArgAlaArgSerThrGlnGly 

Spel Saci 

2569 GAGGTCACCCGCGAGGTGACCGTGAATGTGACTAGTGGGAGGTCAGCATCCCCG 
GluValThrArgGluValThrValAsnValThrSerGlySerSerAlaSerPro 
2623 ACCAGCCCCAAGGTCTTCCCGCTGAGCCTCGACAGCACCCCCCAAGATGGGAACGTGGTC 
ThrSerProLysValPheProLeuSerLeuAspSerThrProGlnAspGlyAsiiValVal 
2 683 GTCGGATGGCTGGTCCAGGGCTTCTTCCCCCAGGAGCCACTCAGTGTGACCTGGAGCGAA 
VaIAlaCysLeuValGlnGlyPhePheProGlnGluProl>euSerVaiThrTrpSerGlu 
2743 AGCGGACAGAACGTGACCGCCAGAAACTTCCCACCTAGCCAGGATGCCTCCGGGGACCTG 
SerGlyGlnAsnValThrAlaArgAsnPheProProSerGlnAspAlaSerGlyAspLeu 
2 803 TACACCACGAGCAGCCAGCTGACCCTGCCGGCCACACAGTGCCCAGACGGCAAGTCCGTG 
TyrThrThrSerSerGlnlieuThrLeuProAlaThrGlnCysProAspGlyLysSerVal 
2863 ACATGCCACGTGAAGCACTACACGAATTCCAGCCAGGATGTGACTGTGCCCTGCCGAGTT 

ThrC^sHisValLysHisTyrThrAsnSerSerGlnAspValThrValProCysArgV 
2923 CCCCCACCTCGCGCATGCTGCGACCCCCGACTGTCGCTGCACCGACCGGCCCTCGAGGAC 
ProProProProProCysCysHisProArgLeuSerLeuHisArgProAlaLeuGluAsp 
2983 CTGCTCTTAGGTTCAGAAGCGAACCTCACGTGCACACTGACCGGCCTGAGAGATGCCTCT 
LeuLeuLeuGlySerGluAlaAsnieuThrCysThrLeuThrGlyLeuArgAspAlaSer 
3043 GGTGCCACCTTCACCTGGACGCCCTCAAGTGGGAAGAGCGCTGTTCAAGGACCACCTGAG 
GlyAlaThrPheThrTrpThrProSerSerGlyliysSerAlaValGlnGlyProProGlu 
3 103 CGTGACCTCTGTGGCTGCTAGAGCGTGTCCAGAGTACTTCCTGGCTGTGCCCAGCCATGG 
ArgAspLeuCysGlyCyeTyrSerValSerArgValLeuProGlyCyBAlaGlnProTrp 
3163 AACCATGGGGAGACCTTCACCTGCACTGCTGCCCACCCCGAGTTGAAGACCCCACTAACC 
AsnHisGlyGluThrPheThrCysThrAlaAlaHisProGluLeuLysThrProLeuThr 
3223 GCCAACATCACAA^TCCGGAAACAGATTCCGGCCCGAGGTCCACCTGCTGCCGCCGCCG 
AlaAsnlleThrLysSerGlyAsnThrPheArgProGluValHisLeuLeuProProPro 
TCGGAGGAGCTGGCCCTGAACGAGCTGGTGACGCTGACGTGCCTGGCACGTGGCTTCAGC 
SerGluGluLeuAlaLeuAsnGluLeuValThrLeuThrCysLeuAlaArgGlyPheSer 
CCCAAGGATGTGCTGGTTCGCTGGCTGCAGGGGTCACAGGAGCTGCCCCGCGAGAAGTAC 
ProLysAspValLeuValArgTrpLeuGlnGlySerGlnGluLeuProArgGluLysTyr 
3403 CTGACTTGGGCATCCCGGCAGGAGCCCAGCCAGGGCACCACCACCTATGCTGTGACCAGC 
LeuThrTrpAlaSerArgGlnGluProSerGlnGlyThrThrThrTyrAlaValThrSer 
3463 ATACTGCGCGTGGCAGCCGAGGACTGGAAGAAGGGGGAGACCTTCTCCTGCATGGTGGGC 
IlelieuArgValAlaAlaGluAspTrpLysLysGlyGluThrPheSerCysMetValGly 
3523 CACGAGGCCCTGCGGCTGGCCTTCAGACAGAAGACCATCGACCGGTTGGCGGGTAAAGCC 

. HisGliiAlaLeuProbeuAlaPheThrGlnLysThrlleAspArgLeuAlaGlyLysPro ■ 
3583 ACCCATATCAATGTGTCTGTTGTGATGGGGGAGGCGGACGGGACCTGCTACAGATCTGAA 
ThrHisIleAsnValSerValValMetAlaGluAlaAspGlyThrCysTyrArgSerGlu 
junEcoRI marker 
3643 AAGGATGAAC1T 
LysAspGluLeu 



3283 
3343 



AAGGATGAACTTTAGAATTCGATATCAAGCTAAT^ 
LvsAsnGluLeu 



3718 TOAATCCTQTTQCCGGTCTTGCC^TQATTATCATATAAtTTCTGTTGAATTACQTTAAGCATC^ 
3798 TCGATCACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAA1TATACATOT 

3878 ATAGCGCGCAAACTAGGATAAATTATCGCGCGCGGTGTC^TCTATGTTACTAGATCGGGGATGTGCCXSGTCTCCOT 
3958 TGAGTCGTATTAATTTCGATAAGCCAGGTTAACCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTAT 
4038 TGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTG^ 

4118 AGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCA^ 
4198 AACCGTAAAAAGGCCGCGTTCCTOGCGTTTTTCCA^^ 

4278 TCAGAGGTG^CGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCrGGAAGCTCCCTCGTGCGCT 
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'ACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTA 



llll ^ACCCTGCCGCTTACCGGATACCTGTC 
4438 

ATCCGGTAACTATGGTCTTGAGTCGAACCCGGTAAGAGACGACTTAT xwi^uuu^ceCTGCGCCTT 

GCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGC 
GGTATCTGGGCTGTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGC 
TAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAA? 
CGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTG 

atccttttaaattaaaaatgaagttttaaatcaatctaaagtatat; 
aatcagtgaggcacctatctcagcgatctgtctatttcgttcatcc? 
cgatacgggagggcttaccatctggccccagtgctgcaatgataccg 

co1 0 gc^taaaccagccagccggaagggccgagcgcagaagtggtcctgcaactttatccgcctccatccagtcta^ 
5238 ttggcgggaaggtagagtaagtagttcxsccagttaatagtttgggcmcgttgi ° oCCTCCATCCAGTCTATT ^g 



4518 ATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAa 

TGAAG 
kAAAG 
3CGCA 
^AGGGj 
TAAAG' 
TCGTO 

5158 



™ AGCGAGGTATGTA ^ 

'AGTTGCCTGACTCCCCGTCGTGTAGATAACTA 



AAAAAAGCGGOTAGCTCCOT ^^ 
™^ AATTCTCTOACTGTCATGCCATCCST ^ 
GAG ^™ A ^ OTGCGACCGAGTTCCTCTTG ^ 

ACCraGTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGeGT^^ 
^GCAAAAAAGGGAATAAGGGCGACACGG^ 



5558 



5958 l^^r^l^ 
III* n^™^ LGTCCCACCTOACGTCTAAG ^ G CATTA^^ 

^™? CTCGCGCGTTO ^ 

6118 TAAGCGGATGCCGGGAGCAGACAAGCCGGTCKGGGCGCGTCAGCGGGTGraGGCGGGTGTC 



Ndel 

«1 SSSS£^^ BM ^ 
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Bean Legumin Signal Peptide 
MetSerLysProPheLeuSerLeuLeuSerLeuSerLeuLeuLeuPheThrSerthrCysLeuAia 
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Nucleotide and Amino Acid Sequence of Protein Coding Region of 

pSHuJ 

1141 

AGGATCTATCGATTCCCGGGTACC ATG GAG AAC CAT TTG CTT TTC TGG GGA GTC CTG GCG 
1201 /i3 ™ et glu asn his !eu leu phe trp gly val leu ala 

GTT TTT ATT AAG GCT GTT CAT GTG AAA GCC CAA GAA GAT GAA AGG ATT GTT CTT GTT GAC 
I26I/3I yS 3 ^ biS Val lyS ala 9ln glu as ? glu ar 9 He val . leu val asp 

AAC AAA TGT AAG TGT GCC CGG ATT ACT TCC AGG ATC ATC CGT TCT TCC GAA GAT CGT AAT 
iSSl/S ^ ^ Ue thr Ser arg ile ile arg se * ser Siu asp pro asJ 

GAG GAC ATT GTG GAG AGA AAC ATC CGA ATT ATT GTT CCT CTG AAC AAC AGG GAG AAT ATC 
X381/7? ar9 ile ±le Val Pr ° 1SU 3311 asn ar 9 9 lu 

TCT GAT CCC ACC TCA CCA TTG AGA ACC AGA TTT GTG TAC CAT TTG TCT GAC CTC TGT AAA 
1441/93 ?r ° l6U tht ar3 Phe Val tyr hi * leu ser as P l^u cys lys 

AAA TGT GAT CCT ACA GAA GTG GAG CTG GAT AAT CAG ATA GTT ACT GCT ACC CAG AGC AAT 
i50lAl3 aSP 3lU Val 3lu leu as P aan 9. ln ile ^1 thr ala thr gin ser asn 

ATC TGT GAT GAA GAC AGT GCT ACA GAG ACC TGC TAC ACT TAT GAC AGA AAC AAG TGC TAC 
lSsl/iS 3 aSP 9 aSP ala 9lU thr ° YS tyr thr tyr aSp arg asn IV s tyr 
ACA GCT GTG. GTC CCA CTC GTA TAT GGT GGT GAG ACC AAA ATG GTG GAA ACA GCC TTA ACC 
.SSl/Ki Pr ° U tyr 9ly gly glu thr ^ met ^1 9lu thr ala leu thr 

CCA GAT GCC TGC TAT CCT GAC TGA ATTC 
pro asp ala cys tyr pro asp 
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Nucleotide and Amino Acid Sequence of Protein Coding Region of 

pSHuSC- 

1137 

GTCGATTCCCGGGTACC ATG GTG CTC TTC GTG CTC ACC TGC 
117 g /g met val leu phe val leu thr cys 

CTG CTG GCG GTC TTC CCA GCC ATC TCC ACG AAG AGT CCC ATA TTT GGT CCC GAG GAG GTG 
^238/29 P Pr ° ±le 36r thr lvs ser pro ile gly pro glu glu val 

AAT AGT GTG GAA GOT AAC TCA GTG TCC ATC ACG TGC TAG TAC CCA CCC ACC TCT GTC AAC 
1298/49 9 3 7 3811 Val Ser ' ±lB thr CyS tyr tyr pro pro thr ser val atn 
CGG CAC ACC CGG AAG TAG TGG TGC CGG CAG GGA GCT AGA GGT GGC TGC ATA ACC CTC ATC 

31 8/69 ar9 lys tyr trp cys axa 9ln gly ala ar ^ cy* ile *5 SS S 

TCC TCG GAG GGC TAC GTC TCC AGC AAA TAT GCA GGC AGG GCT AAC CTC ACC AAC TTC CCG 
SSlS/S U 9 y ^ V ^ ^ 3la 9ly ar3 ala aSn leu thr asn S pro 

GAG AAC GGC ACA TTT GTG GTG AAC ATT GCC CAG CTG AGC CAG GAT GAC TCC GGG CGC TAC 
fi?8/109 9 P ^ ^ ala 9in l6U Ser 9ln aS£> aSp Ser 9 lv ar 9 ty^ 

AAG TGT GGC CTG GGC ATC AAT AGC CGA GGC CTG TCC TTT GAT GTC AGC CTG GAG GTC AGC 
1538/^9 9 7 lU 9ly l6U S6 ^ Phe aSP Val 3Sr 16U 9 lu val 

CAG GGT CCT GGG CTC CTA AAT GAC ACT AAA GTC TAC ACA GTG GAC CTG GGC AGA ACG GTG 
1598/l Y 9 Pr ° 9 Y l6U l6U aSp thr lyS val ^ val asp leu gly arg tS vS 
ACC ATC AAC TGC CCT TTC AAG ACT GAG AAT GCT CAA AAG AGG AAG TCC TTG TAC AAG CAG 

S58A69 asn cys pro phe lys - hr 9 - u asn ala gln lys ars lyB ?er leu tvr ^ S» 

ATA GGC CTG TAC CCT GTG CTG GTC ATC GAC TCC AGT GGT TAT GTG AAT CCC AAC TAT ACA 
1718 ^ Pr ° Val l6U Val ile asp 9er ser ^r val asn pro Jsn tS 

GGA AGA ATA CGC CTT GAT ATT CAG GGT ACT GGC CAG TTA CTG TTC AGC GTT GTC ATC AAC 
?} y 8/2?9 ar9 3SP 9ln gly thr gly gln leU led phe ser val ^1 ile asn 
CAA CTC AGG CTC AGC GAT GCT GGG CAG TAT CTC TGC CAG GCT GGG GAT GAT TCC AAT AGT 
1838/229 ar9 aSP 313 9ly 9ln tyr leU CVS 9 ln ala S*Y asp asp ser asn ser 

AAT AAG AAG AAT GCT GAC CTC CAA GTG CTA AAG CCC GAG CCC GAG CTG GTT TAT GAA GAC 
1398/249 YS aSn ala aSP l6U 9ln Val 1SU ly * pro 9 lu pro 9lu leu val tyr. gtJ asp 
CTG AGG GGC TCA GTG ACC TTC CAC TGT GCC CTG GGC CCT GAG GTG GCA AAC GTG GCC AAA 
«58/2 9 9 9 7 CyS 313 l6U 9ly Pr ° 3lu Val ala asn val ala iyt 

TTT CTG TGC CGA CAG AGC AGT GGG GAA AAC TGT GAC GTG GTC GTC AAC ACC CTG GGG AAG 

! o! 8 /289 CyS ar9 9 SSr Ser gly glu . asn cys asp val val val asn thr leu 9iy lys 

AGG GCC CCA GCC TTT GAG GGC AGG ATC CTG CTC AAC CCC CAG GAC AAG GAT GGC TCA TTC 
20?8/3 a 9 PrO 9lU ar9 116 lEU l6U aSn Pr ° 91- ^p asp gly ser J£ 

AGT GTG GTG ATC ACA GGC CTG AGG AAG • GAG GAT GCA GGG CGC TAC CTG TGT GGA GCC CAT 
2138 V 329 5 ^ 917 ar9 ^ 9lU aSP ala 9^ a rg tyr leu cys gly afa £s 

TCG GAT GGT CAG CTG CAG GAA GGC TCG CCT ATC CAG GCC TGG CAA CTC TTC GTC AAT GAG 
Il98 a 349 9 7 9 9ln 9ly Pr ° 9ln 313 trP 9ln ieU Ph « ™* ^ Su 

GAG TCC ACG ATT CCC CGC AGC CCC ACT GTG GTG AAG GGG GTG GCA GGA AGC TCT GTG GCC 
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glu ser thr lie pro arg ser 
2258/369 

GTG CTC TGC CCC TAG AAC CGT 
val leu cys pro tyr asn arg 
2318/389 

GGG GCC CAG AAT GGC CGC TGC 
gly ala gin asn gly arg cys 
2378/409 

TAC GAG GGC CGC CTC TCC CTG 
tyr glu gly arg leu ser leu 
2438/429 

AAC CAG CTC ACC AGC CGG GAC 
asn gin leu thr ser arg asp 
2498/449 

TGG AGG ACC ACC GTG GAG ATC 
trp arg thr thr val glu ile 
2558/469 

AAT GTC ACG GCT GTG CTG GGA 
asn val thr ala val leu gly 
2618/489 

TCC TCG TAC GAG AAA TAC TGG 
ser ser tyr glu lys tyr trp 
2678/509' 

CAA GAC GAA GGC CCC AGC AAG 
gin asp glu gly pro ser lys 
2738/529 

CTG ACC CTG AAC CTG GTG ACC 
leu thr leu asn leu val thr 
2798/549 

GGC CAC TTC TAT GGA GAG ACT 
gly his phe tyr gly glu thr 
2858/569 

GGG TCC CGC GAT GTC AGC CTA 
gly ser arg asp val ser leu 
'2918/589 

TCT GGT TTT CGG GAG ATT GAG 
ser gly phe arg glu ile glu 
2978 
ATTC 



pro thr val val lys gly 

AAG GAA AGC AAA AGC ATC 
lys glu seir lys ser ile 

CCC CTG CTG GTG GAC AGC 
pro leu leu val asp ser 

CTG GAG GAG CCA GGC AAC 
leu glu glu pro gly asn 

GCC GGC TTC TAC TGG TGT 
ala gly phe tyr trp cys 

AAG ATT ATC GAA GGA GAA 
lys ile ile glu gly glu 

GAG ACT CTC AAG GTC CCC 
glu thr leu lys val pro 

TGC AAG TGG AAT AAC ACG 
cys lys trp asn asn thr 

GCC TTC GTG AAC TGT GAC 
ala phe val asn cys asp 

AGG GCT GAT GAG GGC TGG 
arg ala asp glu gly trp 

GCA GCC GTC TAT GTG GCA 
ala ala val tyr val ala 

GCG AAG GCA GAC GCT GCT 
ala lys ala asp ala ala 

AAC AAA GCC ATT CAG GAT 
asn lys ala ile gin asp 



val ala gly ser ser val ala 

AAG TAC TGG TGT CTC TGG GAA 
lys tyr trp cys leu trp glu 

GAG GGG TGG GTT AAG GCC CAG 
glu gly trp val lys ala gin 

GGC ACC TTC ACT GTC ATC CTC 
gly thr phe thr val ile leu 

CTG ACC AAC GGC GAT ACT CTC 
leu thr asn gly asp thr leu 

CCA AAC CTC AAG GTT CCC GGG 
pro asn leu lys val pro gly 

TGT CAC TTT CCA TGC AAA TTC 
cys his phe pro cys lys phe 

GGC TGC CAG GCC CTG CCC AGC 
gly cys gin ala leu pro ser 

GAG AAC AGC CGG CTT GTC TCC 
glu asn ser arg leu val ser 

TAC TGG TGT GGA GTG AAG CAG 
tyr trp cys gly val lys gin 

GTT GAA GAG AGG AAG GCA GCG 
val glu glu arg lys ala ala 

CCT GAT GAG AAG GTG CTA GAC 
pro asp glu lys val leu asp 

CCC AGG CTT TTT GCA GAG TGA 
pro arg leu phe ala glu 
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1 CTGGCCCKSCGCCAGA^ 

81 TTAAATATCCGATTATTCTAATAAACGCTCTTTTCTCTTAGGTTTACCCGCCAATATAT^ 
■ 161 AAACTGAAGGCGGGAAACGACAATCTGATCAT^GCGGAGAATTAAGGGAGTCACGTTATGACCCCCGCCGAT^ 
241 GACAAGCCGTTTTACGTTTGGAACTGACAGAAGCGCAACGATTGAAGGAGCCAG 
321 CTAGTAACATAGATGACACGGCGCGCGATAATTOATCCTAGTTTGCGCGCTATATTTTQTTTTCTATC^ 
401 TATAATTGOGGGACTCTAATCATAAAAACCCATCTCATAAATAACGTCATGCATTAC^TGTTAATTATTACAT 
481 GTAATTCAACAGAAATTATATGATAATCATCGCAAGACCGGCAACAGGATTCAATCTTAAGAAAC 

Xmal 

Sacl . NotI srtial 

561 raGAACGATCGGGGAAATTCGAGCTCCACCGCGGTGGCGGCCGCTGTAGAACITAGTGGATCCCCCGGGC 

Sail Xhol Kpnl 

641 GATCAGATCTGATCAAGCTTATCGATACCGTGGACCTCGAGGGGGGGCCCGGTACCCGTAGAGTCGATTTGGTGTATCGA ' 
721 GATTGGTTATGAAATTCAGATGCTAGTGTAATGTATTGGTAATTTGGGAAGATATAATAGGAAGCAAGGCTAXTTATCCA 
801 TTTCTGAAAAGGCGAAATGGCGTCACCGCGAGCGTCACGCGCATtCCGTTCTrGGTGTAAAGCGTtGTTTC 
881 TGACTAGGGAGGCTTGGCGTGTCAGCGTATCTATTCAAAAGTCGTTAATGGCTGGGGATCAAGAAAAAGTTGGAATAGAA 

961 acagaatacccgcgaaattcaggcccggttggcAtgtcctacacgccgaaataaacgaccaaattagtag^ ' 

1041 ACTGACTCGGATACTTACGTCACGTCTTQCGCACTGATTTGAAAAATCTCCCT^ 

BamHI 

1121 T3CTTCAAAAGCAATGGGATTGACCAGCTCGCGGATCCTACAGGCCAAATTCGCTCTTAGCCGTACAATATTACTCACCG 
1201 GTGCGATGCXICCCCATGGTAGGTGAAGGTGGAAATTAATGATCCATCraGAGA^CACAGGCCCACIA^CAGCTACCAGTTT 
12 81 CCTCAAGGGTCCACCAAAAAOSTAAGCGCTTACGTACATGGTCGATAAGAA 
BamHI 

1351 GTCGCTTTCAGGGATCCTACAGGCCAAATTCGCTCTTAGCCGTACAATATTACTCACCGGTGCGATGCCCCCCATCGTAG 
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1441 GTGAAGGTGGAAATTAATGATCCATCTTGAGACCAC^GGCCCACAAC 

BamHI 

1521 GTAAGCGCTTACGTACATGGTCGATAAGAAAAGGGAATTTGTAGATGT^ 
1601 GGCCAAATTCGCTCTTAGCCGTACAATATTACTCA^ 
1 681 CCATCTTGAGACCACAGGCCCACAAC&GCTACCAGTT^ 

BamHI 

1761 CGATAAGAAAAGGCAATTTGTAGATGTTAACATCCAACGTCGCTTTCAGGGATCC 
1841 TATAATAATTATATTTGTAAGAATATTATTATAATAATATAAAATATATATATATAAM 

1921 AATTATTAATAATATATATATTAATCATTTAGATATAT^ATTCTATAGCCTTAGACTCCTCATCAATAGAAGACTACGTA 
2001 TAAAAATAATCAGATAACATCTAAAAGATGTAGATAAATAATAGTTGTTTCA 

20 81 CTGCCGCAAGGACTCAGGGCGCAAGGGCTGCTAAAGGAAGCGGAACACGTAGAAAGCCAGTCCGCAGAANCGGTGCTGAC ' 

2161 CCCX3GATGAATGTCAGCTACTGGGCTATCTGGACAAGGGAA 

2241 GCTTACATGGCGATAGCTAGACTGGGCGGTTTTATGGACAGCAAGCGAACCOT 

2321 AGGTTGGGAAGCCCTGCAAAGTAAACTGGATGGCTTTCTTGCCGCCAAGGATCT^ < 
24 01 GCGGAGAATTAAGGGAGTCACGTTATGACCGCCGCCGATGACGCGGGACAAGCCGTTTTACGTTTGGAAG 
2481 GCAACGTTGAAGGAGCCACTCAGCCGCGGGTTTCTGGAGTTTAA^ ' 
2561 GTTCAAAAGTCGCCTAAGGTCACTATCAGCTAGCAAATATTTCTTGTC^AAAATGCTCCACTGACGTTC^TAAATTCC 
2641 CTCGGTATCCAATTAGAGTCTCATATTCACTCTCAATCCAGATCTGGATCGTTTCGCATGATTGAACAAGATGGATTGCA ' 

MetrieGluGlnAspGlyLeuHi 

2721 CGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTA^ 

sAlaGlySerProAlaAlaTjqpValGluArgLeuPheGlyT^ 
2801 CCGTGTTCCGGCTGTCAGCGCAGGGGGGCCCGGTTCTTTTTGTCAAGACCGACGTGTCCGGTGCCCTGAATGAACT 

laValPheArgLeuSerAlaGlnGlyArgProValLeuPheValLysThrAspLeuSerGlyAlaLeuAsnGluLeuGln 
2881 GACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTCTCACTGAAGCGGG '' 

AspGluAlaAlaArgLeuSerTrpLeuAlaThr^ 
2961 AAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTC 

yArgAspTrpLeuLeuLeuGIyGluValProG^ 
3041 TCATGGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATC 

leMetAlaAspAlaMetArgArgLeuHisThrLeuAspProAlaThrCysProPheAspHisGlnAlaLysHisArglle 
3121 GAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGC 

GluArgAlaArgThrArgMetGluAlaGlyLeuValAspGlnAspAspLeiiAspGluGluHis^ 
3201 CGAACTGTTCGCCAGGCTCAAGGCGGGCATGCCGGACGGCGATGATCTCGTCGTCACCCATGGGGATGCCTGCTTG 

aGluLeuPheAlaArgLeuLysAlaArgMetProAspGlyAspAspLeuValValThrHisGlyAspAlaCysLeuProA 
3281 ATATCATGGTGGAAAATGGCCGCTTTTCTGGAlTCATGGACTGTGGCCGGCTG 

snlleMetValGluAsnGlyArgPheSerGlyPhelleAspCysGlyArgLeuGlyValAlaAapArgTyrGlnAspIle 
3361 GCGTOGGCTACCCGTGATATTGCTGAAGAGCTTGGCGG 

AlaLeuAlaThrArgAspIleAlaGluGluLeuG^ 

BamHI 

3441 TCCGGATTCGCAGCGCATCGGGTTCTATCGGCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGAGGATCGCCCGATGAGG 
aProAspSerGlnArglleAlaPlaeTyrArgLeuLeuAspGluPhePheo o o 

3521 TAAGCTAGCTATATCATCAATTTATGTATTACACATAATATCGCACTCAGTCTTTCAXCTACGGCAATGTACCAGCTGAT 
3601 ATAATCAGTTATTGAAATATTTCTGAATTTAAACTTGCATCAATAA . 
3681 TGTTATTTTATGAATAAATATTTAAACTATATTTCT^ 

3761 TGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAG 
PVUI Bgll 

3841 AGGCCCGC^CCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGCCCGCTCCTT^ 

3921 CGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACC 
4001 TGGACCCCAAAAAACTTGATTTGGGTGATGGTTCACGTAGTGGGCCATCGCGCTGATAGACGGTTT 
4081 TTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTC 
4161 TTTATAAGGGATTTTGCCGATrTCG^ 

4241 GCTGCAACTCTCTCAGGGCCAGGCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCACTGGTGAAAAGAAAAACCACCCCAG 
4321 TACATTAAAAACGTCCGCAATGTGTTATTAAGTC^ 

4401 CAGCCAACAGCTCCCCGACCGGCAGCTCGGCACAAAATCACCACTCGATAGAGGCAGCCCATCAG 
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1 CTGATGGGCTGCCTGTATCGAGTGGTGATTTTGTGCCGAGCTGCCGGTCGGGGAGCTGTTGGCTGGCTGGTGGCAGGATA 
8 1 TATTGTGGTGTAAACAAATTGACGCTTAGACAACTTAATAACAC^ 
161 TTTTCACCAGTGAGACGGGCAACAGCTGATTGCCCTTCACCGCCTGGCCCTGAGAGAGTTGCAGCAAGCGGTCCACGCTG 
241 GTTTGCCCC^GCAGGCGAAAATCCTGTTTGATGGTGGTTCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGCCC 
3 2 i GAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAA 
401 AACCGTCTATCAGGGGGATGGCCGACTACGTGAACC^TCACC 

481 TAAATCGteAACCCTAAAGGGAGCCC^ . 
Sfo I 

561 AAAGCGAAAGGAGCGGGCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATC^^ 
641 AGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTC^ 
Xmn I 

721 ACGGCCAGTGAATTAATTCCCATCTTGAAAGAAATATAGTTTAAATATTTATTGATAAAATAACAAGTCAGGTATTATAG 

Xmn I 

801 TCCAAGCAAAAACATAAATTTATTGATGCAAGTTTAAATT 
881 GCCGTAGATGAAAGACTGAGTGCGATATTATGTGTAAra^ 

961 CTCAGAGTCCCGCTCAGAAGAACTCGTCAAGAAGGCGATAGAAGGCGATGCGCTGCGAATCGGGAGCGGCGATACCGTAA 
o o ophePheGliiAspLeuLeuArgTyrPheAlalleArgGlnSerAspProAlaAlalleGlyTyrL 
1041 AGCACGAGGAAGCGGTCAGCCCATTCGCCGCCAAGCTCTTCAGCAATATCACGGGTAGCCAACGCTATGTCCTGATAGCG 
euValLeuPheArgAspAlaT^GluGlyGlylieuGluGluAlalleAapArgThrAlaLeuAlalleAspGlnTyrArg 
1121 GTCCGCCACACCCAGCCGGCCACAGTCGATGAATCCAGAAAAGCGGCCATTTTCCACCATGATArTCGGCAAGCAGGCAT 

AspAlaValGlyLeuArgGlyCysAspIlePheGlySerPheArgGlyAsnGluValMetrieAsnProLeu 
1201 CGCCATGGGTCACGACGAGATCATCGCCGTCGGGCATGCGCGCCTTGAGCCTGGCGAACAGTTCGGCTGGCGCGAGCCCC 
pGlyHisThrValValLeuAspAspGlyAspProMetArgAlaLysLeuArgAlaPheLeuGluAlaProAlaLeuGlyG. 
1281 TGATGCTCTTCGTCCAGATCATCCTGATCGACAAGACCGGCTTC 
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lnHisGluQluAspLeuAspAspGliiAspValLeuGlyAlaGluMetArgThrArgAlaArgGluIleArgHisLysAla 
1361 TTGGTGGTCGAATGGGCAGGTAGCCGGATC^GCGTATGCAGCCGCCGaATTGC^TCAGCCATGATGGATACTTTCTCGG 

GlnHi s AspPheProCy s ThrAl aProAspLeuThrHi sLeuArgArgMe t Al aAspAl aMet 1 1 eSerValLy sGluAl 
1441 CAGGAGCAAGGTGAGATGACAGGAGATCCTGCCCCGGCACTTCGCCCAATAGC AGCCAGTGCCTTCCCGCTTC AGTGACA 

aProAlaLeuHisSerSerLeuLeuAspGlnGlyProValGluGlyLeuLeuLeuTrpAspArgGlyAlaGluThrValV 

PstI 

1521 ACGTCGAGCACAGCTGCGCAAGGAACGCCCGTCGTGGCCAGCCACGATAGCCGCGCTGCCTCGTC 
alAspLeuValAlaAlaC^sProValGlyThrThrAlaLeuT:^ 

Sfo I 

1601 GGCACCGGACAGGTCGGTCTTGACAAAAAGAACCGGGCGCCCCTGCGCTGACAGCCGGAAGACGGCGGCATCAGAGCAGC 
AlaGlySerLeuAspThxIiysValPheLeuValProArgGlyGlnAlaSerLeuArgPheValAlaAlaAspSerCysGl 

1681 CGATTGTCTGTTGTGCCC^GTCATAGCCG^TAGCCTCTCCACCCAAGCGGCCGGAGAACCTGCGTGCAATCCATCTTGT 
ylleThrGlnGlnAlaTrpAspTyrGlyPheLeuArgGluValTrpAlaAlaProSerGlyAlaHisLeuGlyAspGlnG 
Bglll 

1761 TCAATCATGCGAAACGATCCAGATCTGGATTGAGAGTGAATATGAGACTGTAAnGGATACCGAGGGGAATTTATGGAAG 
luXleMet 

1841 GTC^GTGGAGCATTTTTGACAAGAAATATTTGCTAGCTGATAGTC 

SacII 

1921 CTGACGTATGTGCTTAGCTCATTAAACTCCAGAAACCCGCGGCTGAGTGGCTCCTTCAACGTTGCGGTTCTGTCAGTTCC 
2001 AAACGTAAAACGGCTTGTCCCGCGTCATCGGCGGGGGTCATAACGTGACTCCCTTAATTCTCCGCTCATGATCTTGATCC 

Sfo I 

2081 CCTGCGCCATCAGATCCTTGGCGGCAAGAAAGCCATCCAGTTTACTO 

2161 CAGCTGGCAATTCGGGTTCGCTTGCTGTCCATAAAACCGCCCAGTCTAGCTATCGCCATGTAAGCCCACTGCAAGCTACC 
2241 TGCTTTCTCTTTGCGCTTGCGTTTTCCCTTGT^^ 

2321 CGGACTGGCTTTCTACGTGTTCCGCTTCCTTTAGCAGCCCTTGCGCCCTGAGTGCTTGCGGCAGCGTGAAGCTCTGGACA 
2401 TCATGTTGGATATGAAACAACTATTATTTATCTACATGTTTTAGATGTTATCTGATTATTTTTATACGTAGTCTTCTATT 
2481 GATGAGGAGTCTAAGGCTATAGAATTATATATCTAAATGATTAATATATATATTATTAATAATTAACAATAATTAATATA 

Nrul 

2561 TTATAATTTATATATATATATTTTM 

2641 TAAGCTCGCGGATCCCTJ3AAAGCGACGTTGGATGTTAACM 

2721 CTTACGTCTTTGGTGGACCCTTGAGGAAACTGGTAGCTGTTGTGGGCCTGT^ 

2801 TTCACCTAGGATGGGGGGCATCGCACCGGTGAGTAATATTGTACGGCTAAGAGCGAATTTGGCCTGTAGGATCCCTGAAA 
2881 GCGACGTTGGATGTTAACATCTACAAATTGCCTTTTCTTA^ 

2961 TGAGGAAACTGGTAGCTGTTGTGGGCCTGTGGTCTCAAGATGGATCATTAATTTCCACCTTCACCTACGATGGGGGGCAT 

3041 CGCACCGGTGAGTAATATTGTACGGCTAAGAGCGAATTTGGCCTGTAGGATCCCTGAAAGCGACGTTGGATGTTAACATG 

3121 TACAAATTGCCTTTTCTTATCGACCATGTACGTAAGCGCTTACGTTTTTGGTGGACCCTTGAGGAAACTGGTAGCTGTTG 

3201 TGGGCCTGTGGTCTCAAGATGGATCATTAATTTCCACCTTC 

3281 ACGGCTAAGAGCGAATTTGGCCTGTAGGA/TCCGCGAGCTGGTCAATC^ 

3361 TTTCTCGATCGAGGGAGATTTTTCAAATCAGTGCGCAAGACGTGACGTAA^^ 

3441 ATTTGGTCGTTTATTTCGGCGTGTAGGACATGGCAACCGGGCCTGA ATTTCGCGGGTATTCTGTTTCTATTCCAACTTTT 
3521 TCTTGATCCGCAGCGATTAACGACTTTTGAATAGATACGCTGACACGCCAAGCCTCG 

3601 CGCTTTACAGCAAGAACGGAATGCGCGTGACGCTCGCGGTGACGCCATTtCGCCTTTXCAGAAATGGATAAATAGCCTTG 
3681 CTTCCTATTATATCTTCCCAAATTACCAATAC^TTACACTAGC^^ 
Kpnl 

3761 ACTCTAGGGGTACCATGGTGCTCTTCGTGCTCACCTGCCTGCTGGCGGTCTTCCCAGCCATC 
MetValLeuPheValLeuThrCysLeuLeuAlaValPheProAlalle 
3823 TCCACGAAGAGTCCCATATTTGGTCCCGAGGAGGTGAATAGTGTGGAAGGTAACTCAGTG 

SerThrLysSerProIlePheGlyProGluGluValAsnSerValGluGlyAsnSerVal 
3883 TCCATCACGTGCTACTACCCACCCACCTCTGTCAACCGGCACACCCGGAAGTACTGGTGC 

Ser 1 1 eThrCysTyrTyrProProThrS erValAsnArgHi sThrArgLys TyrTrpCy s 
3943 CGGCAGGGAGCTAGAGGTGGCTGCATAACCCTCATCTCCTCGGAGGGCTACGTCTCCAGC 

ArgGlnGlyAlaArgGlyGlyCysIleThrLeuIleSerSerGluGlyTyrValSerSer 
4003 AAATATGCAGGCAGGGCTAACCTCACCAACTTCCCGGAGAACGGCACATTTGTGGTGAAC 

LysTyrAl aGl yArgAl aAsnJjeuThrAsnPheProGluAsnGlyThrPheVal Va lAsn 
4063 ATTGCCCAGCTGAGCCAGGATGACTCCGGGCGCTACAAGTGTGGCCTGGGCATCAATAGC 

IleAlaGlnLeuSerGlnAspAspSerGlyArgTyrLysCysGlyLeuGlylleAsiiSer 
4123 CGAGGCCTGTCCTTTGATGTCAGCCTGGAGGTCAGCCAGGGTCCTGGGCTCCTAAATGAC 
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ArgGlyLeuSerPheAspValSerLeuGluValSerGlnGlyProGlyLeuLeuAsnAsp 
4183 ACTAAAGTCTACACAGTGGACCTGGGCAGAACGGTGACCATCAACTGCCCTTTCAAGACT 

ThrLysValTyrThrValAspLeuGlyArgThrValThrlleAsnCysProPheLysThr 
4243 GAGAATGCTCAAAAGAGGAAGTCCTTGTACAAGCAGATAGGCCTGTACCCTGTGCTGGTC 

GluAsnAlaGlnLyBArgLysSerLeuTyrLysGlnlleGlyLeuiyrProValLeuVal 
4303 ATCGACTGCAGTGGTTATGTGAATCCCAACTATACAGGAAGAATACGCCTTGATATTGAG 

IleAspSerSerGlyTyrValAsnProAsnTyrThrGlyArglleArgLeuAspIl 
4363 GGTACTGGCCAGTTACTGTTCAGCGTTGTCATCAACCAACTCAGGCTCAGCGATGCTGGG 

GlyThrGlyGlnLeuLeuPheSerValVallleAsnGXnLeuArgLeuSerAspAlaGly 
4423 CAGTATCTCTGCCAGGCTGGGGATGATTCCAATAGTAATAAGAAGAATGCTGACCTCCAA 

GlnTyrLeuGysGlnAlaGlyAspAspSerAsnSerAsnLysLysAsnAlaAspLeuGln . 
4483 GTGCTAAAGCdCGAGCCCGAGCTGGTTTATGAAGACCTGAGGGGCTCAGTGACCTTCCAC 

ValLeuLysProGluProGluLeuValTyrGluAspLeuArgGlySerValThrpheHis 
4543 TGTGCCCTGGGGCCTGAGGTGGCAAACGTGGCGAAATTTCTGTGCCGACAGAGCAGTGGG V 

Cy3AlaLeuGlyProGluValAlaAsnValAlaLysPheLeuCysArgGlnSerSerGly 
4603 GAAAACTGTGACGTGGTCGTCAACACCCTGGGGAAGAGGGCCCCAGCCTTTGAGGGCAGG ^ 

GluAsnGysAspValValValAsnThrLeuGlyLysArgAlaProAlaPheGluGlyArg 
4663 ATCCTGCTCAACCCCCAGGACAAGGATGGCTCATTCAGTGTGGTGATCACAGGCCTGAGG 

IleLeuLeuAsnProGlnAspLysAspGlySerPheSerValVallleThrGlyLeuArg 
4723 AAGGAGGATGCAGGGCGCTACCTGTGTGGAGCCCATTCGGATGGTCAGCTGCAGGAAGGC 

LysGliiAspAlaGlyArgTyrLeuCysGlyAlaHisSerAspGlyGlnLeuGlnGluGly 
4783 TCGCCTATCCAGGCCTGGCAACTCTTCGTCAATGAGGAGTCCACGATTCCCCGCAGCCCC 

SerProIleGlnAlaTrpGlnLeuPheValAsnGluGluSerThrlleProArgSerPro 
4843 ACTGTGGTGAAGGGGGTGGCAGGAAGCTCTGTGGCCGTGCTCTGCCCCTACAACCGTAAG 

ThrValValLysGlyValAlaGlySerSerValAlaValLeuCysProTyrAsnArgLys 
4903 GAAAGCAAAAGCATCAAGTACTGGTGTCTCTGGGAAGGGGCCCAGAATGGCCGCTGCCCC 

GluSerLysSerlleLysTyrTrpCysLeuTrpGluGlyAlaGlnAsnGlyArgCysPro 
4963 CTGCTGGTGGACAGCGAGGGGTGGGTTAAGGCCCAGTACGAGGGCCGCCTCTCCCTGCTG 

LeuLeuValAspSerGluGlyTrpValLysAlaGlnTyrGluGlyArgLeuSerLeuLeu 
5023 GAGGAGCCAGGCAACGGCACCTTCACTGTCATCCTCAACCAGCTCACCAGCCGGGACGCC 

GluGluProGlyAsnGlyThrPheThrVallleLeuAsnGlnLeuThrSerArgAspAla 
5083 GGCTTCTACTGGTGTCTGACCAACGGCGATACTCTCTGGAGGACCACCGTGGAGATCAAG 

GlyPheTyrTrpCysLeuThrAsnGlyAspThrLeuTrpArgThrThrValGluIleLys 
5143 ATTATCGAAGGAGAA.CCAAACCTCAAGGTTCCCGGGAATO 

IlelleGluGlyGluProAsnLeuLysValProGlyAsnValThrAlaValLeuGlyGlu 
5203 ACTCTCAAGGfCCCCTGTCACTTTCCATGCAAATTCTGCTCGTACGAGAAATACTGGTGC 

ThrLeuLysValProCysHisPheProCysLysPheSerSer^rGluLysTYrTrpCys 
5263 AAGTGGAATAACACGGGCTGCCAGGCCCTGGCCAGCCAAGAGGAAGGCCCCAGGAAGGCC 

LysTrpAenAsnThrGlyCysGlnAlaLeuProSerGlnAspGluGlyProSerLysAla 
5323 TTCGTGAACTGTGACGAGAACAGCCGGCTTGTCTCCCTGACCCTGAAC CTGGTGACCAGG 

PheValAsnCysAspGliiAanSerArgLeuValSerLeuThrljeuAsnLeuValTiLrArg 
5383 GCTGATGAGGGCTGGTACTGGTGTGGAGTGAAGCAGGGCCACTTCTATGGAGAGACTGCA 

AlaAspGluGlyTrpTyrfrpCysGlyValLysGlnGlyHisPheTyrGlyGluTlirAla 
5443 GCCGTCTATGTGGCAGTTGAAGAGAGGAAGGCAGCGGGGTCCCGCGATGTCAGCCTAGCG 

AlaValTyrValAlaValGluGluArgLysAlaAlaGlySerArgAspValSerLeuAla 
5503 AAGGCAGACGCTGCTCCTGATGAGAAGGTGCTAGACTCTGGTTTTCGGGAGATTGAGAAC 

LysAlaAspAlaAlaProAspGluriysValLeuAspSerGlyPheArgGluIleGluAsn 

EcoRI 

5563 AAAGCCATTCAGGATCCCAGGCTTTTTGCAGAGTGAATTCCCGATC 

LysAlalleGlnAspProArgLeuPheAlaGlu 
5631 TTTCTTAAGATTCAATCCTGTTGGCGGTCTTGCGATGATTATCATATAATTTCTGTTGAATT 

5711 TTAACATX3TAATGCATGACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATTTAATACGCGATA 
5791 GAAAACAAAATATAGCGCGC^AACTAGGATAAATTATCGCGCGCGGTGTCATCTATGTTACTA^ 
Clal 

5871 CGGTATCGATAAGGATCCCTGAi^GCGACGTTGGATGTTAACATCTACAAATTGCCTTTTCTTATCG ACCATGTACGTAA 
5951 GCGCTTACGTTTTTGGTGGACCCTTGAGGA 

6031 ACCTTCACCTACGATGGGGGGCATCGCACCGGTGAGTAATATTGTACGGCTAAGAGCGAATTTGGCCTGTAGGATCCCTG 
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5i « AAA f^ 6 " G ^ TGTTAA ^^ 

6191 COTGAGGAAACTGGTAGCTGTTGTGGGCCTGT^ 

^ G ^ ACCGG ™ GTAAT ^^^ 
6351 ATCraCAAMTOCCWTTC^ 

6511 TGTACGGCTAAGAGCGAATTTGGCCTGTAGGATCCGGGAGCTGGTCAATCCC^TTGCTTTT 
Xho I 

«tn £!^^ CGAGGG ^^^ 

fS« T^TCGTTTATTTCGGCGTGTAGGACATC^ 

6831 GCTTTACAGGAAGAACGGAATGCGCGTGACGCTCGCGGTGACGCCATTTCGTCTTTTC^ 
6911 TTCCTATTATATCTTCCCAAAXTACCAATACATOACACTAG^ 

AJDaX 

6991 CTCTAGAGGATCTAACCATGGGATCTAAACCTTTTT^ 

MetGlySerLysProPheLeuSerLeuLeuSerLeuSerLeuLeu 

Spel 

7053 TTGTTTACATCTACTAGTTTGGCACAAGAAGATGAAAGGATTGTTCTTGTTGACAACAAA 
LeuPheThrSerThrSerLeuAlaGlnGluAspGluArglleValLeuValAspAsnliys 
7113 TGTAAGTGTGCCGGGATTACTTCCAGGATCATCGGTTCTTCCGAAGATCGTAATGAGGAC 
CysLysCysAlaArglleThrSerArgllelleArgSerSerGluAspProAsnGluAsp 
7173 ATTGTGGAGAGAAACATCCGAATTATTGTTCCTCTGAACAACAGGGAGAATATCTCTGAT 
IleValGluArgAsnlleArgllelleValProLeuAsnAsnArgGluAsnlleSerAsp 
7233 CCCAC CTCACCATTGAGAAC C AGATTTGTGTACC ATTTGTCTGACCTCTGTAAAAAATGT 
ProThrSerProLeuArgThrArgPheValTyrHisLeuSerAspLeuCysLysLysCys 
GATCCTACAGAAGTGGAGCTGGATAATCAGATAGTTACTGCTACCCAGAGCAATATCTGT 
AspProThrGluValGluLeuAspAsnGlnlleValThrAlaThrGlnSerAsnlleCys 
GATGAAGACAGTGCTACAGAGACCTGCTACACTTATGACAGAAACAAGTGCTACACAGCT 
AspGluAspSerAlaThr^luThrCysTyrThrTyrAspArgAsnLysCysTyrThrAla 
GTGGTCCCACTCGTATATGGTGGTGAGACCAAAATGGTGGAAACAGCCTTAACCCCAGAT 

ValValProLeuValTyrGlyGlyGluThrLysMetValGluThrAlaLeuThrProAsp 
Sad 



7293 
7353 
7413 



74 73 GCCTGCTATCCTGACTGAGCTCGAATTTCCCCGATCGTTCAAACATTTGGCAATAAAGTTTCTTAAGATTGAAT 
AlaGysTyrProAspooo 

11V, ™;!P TGCCGGTCOTGCGA ^^ 

Inln 'SZ^™*^^ 

77«7 ^^"^^^ 

7787 CCTTCAATCGTTGOSGTTCTGTCAGTTC^^ 

llVn ^ WNCrCa ^^ 

7947 GGTAAACCTAAGAGAAAA^GOGTOTATTAG^ 



Bglll Sfo I 

8027 TTGTATGTGCATGCCAACCACAGGTTCCCCAGATCTGGCGCCGGCCAG 
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sequence Listing 
<110> planet biotechnology, incorporated 

Larrick, James William 
Wycof f , Keith Lynn 

<120> NOVEL IMMUNOADHES I ON . FOR THE PREVENTION 
' OF RHINOVIRUS INFECTION 



<130> 030905 . 0003 . WO 

<140> To be assigned 
<141> 2001-04-27 

<150> SO/200,298 
<151> 2000-04-27 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1596 
<212> DNA 
<213> Homo Sapien 

<220> 

<221> CDS 

<222> (1) . . . (1596) 

<223> ICAM-1 

<400> 1 

atg get ccc age age ccc egg ccc gcg ctg ccc gca etc ctg gtc ctg 48 
Met Ala Pro Ser Ser Pro Arg Pro Ala Leu Pro Ala Leu Leu Val Leu 
1 5 10 15 

etc ggg get ctg ttc cca gga cct ggc aat gec cag aca tct gtg tec 96 
Leu Gly Ala Leu Phe Pro Gly Pro Gly Asn Ala Gin Thr Ser Val Ser 
20 25 30 

ccc tea aaa gtc ate ctg ccc egg gga ggc tec gtg ctg gtg aca tgc 144 
Pro. Ser Lys Val lie Leu Pro Arg Gly Gly Ser Val Leu Val Thr Cys 
35 40 ' 45 * 

age ace tec tgt gac cag ccc aag ttg ttg ggc ata gag ace ccg ttg 192 
Ser Thr Ser Cys Asp Gin Pro Lys Leu Leu Gly lie Glu Thr Pro Leu 
50 55 60 

cct aaa aag gag ttg etc ctg cct ggg aac aac egg aag gtg tat gaa 240 
Pro Lys Lys Glu Leu Leu Leu Pro Gly Asn Asn Arg Lys Val Tyr Glu 
65 70 75 80 

ctg age aat gtg caa gaa gat age caa cca atg tgc tat tea aac tgc 2 68 

Leu Ser Asn Val Gin Glu Asp Ser Gin Pro Met Cys Tyr Ser Asn Cys 
85 90 . 95 
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cct gat ggg cag tea aca get aaa acc ttc etc acc gtg tac tgg act 33 6 

Pro Asp Gly Gin Ser Thr Ala Lys Thr Phe Leu Thr Val Tyr Trp Thr 
100 . 105 110 

cca gaa egg gtg gaa ctg gca ccc etc ccc tct tgg cag cca gtg ggc 384 
Pro Glu Arg Val Glu Leu Ala Pro Leu Pro Ser Trp Gin Pro Val Gly 
115 120 125 

aag aac ctt acc eta cgc tgc cag gtg gag ggt ggg* gca ccc egg gec 432 
Lys Asn Leu Thr Leu Arg Cys Gin Val Glu Gly Gly Ala Pro Arg Ala 
130 135 140 

aac etc acc gtg gtg ctg etc cgt ggg gag aag gag ctg aaa egg gag 480 
Asn Leu Thr Val Val Leu Leu Arg Gly Glu Lys Glu Leu Lys Arg Glu 
145 150 155 160 

cca get gtg ggg gag ccc get gag gtc acg acc acg gtg ctg gtg agg 52 8 

Pro Ala Val Gly Glu Pro Ala Glu Val Thr Thr Thr Val Leu Val Arg 
165 170 - 175 

aga gat cac cat gga gee aat ttc teg tgc cgc act gaa ctg gac ctg 576 
Arg Asp His His Gly Ala Asn Phe Ser Cys Arg Thr Glu Leu Asp Leu 
180 185 190 

egg ccc caa ggg ctg gag ctg ttt gag aac acc teg gee ccc tac cag 624 
Arg Pro Gin Gly Leu Glu Leu Phe Glu Asn Thr Ser Ala Pro Tyr Gin 
195 200 205 

etc cag acc ttt gtc ctg cca gcg act ccc cca caa ctt gtc age ccc 672 
Leu Gin Thr Phe Val Leu Pro Ala Thr Pro Pro Gin Leu Val Ser Pro 
210 215 220 

egg gtc eta gag gtg gac acg cag ggg acc gtg gtc tgt tec ctg gac 72 0 

Arg Val Leu Glu Val Asp Thr Gin Gly Thr Val Val Cys Ser Leu Asp 
225 230 235 240 

ggg ctg ttc cca gtc teg gag gee cag gtc cac ctg gca ctg ggg gac 768 
Gly Leu Phe Pro Val Ser Glu Ala Gin Val His Leu Ala Leu Gly Asp 
245 250 255 

cag agg ttg aac ccc aca gtc acc tat ggc aac gac tec ttc teg gec 816 
Gin Arg Leu Asn Pro Thr Val Thr Tyr Gly Asn Asp Ser Phe Ser Ala 
260 265 270 

aag gee tea gtc agt gtg acc gca gag gac gag ggc acc cag egg ctg 864 
Lys Ala Ser Val Ser Val Thr Ala Glu Asp Glu Gly Thr Gin Arg Leu 
275 280 285 

acg tgt gca gta ata ctg ggg aac cag age cag gag aca ctg cag aca 912 
Thr Cys Ala Val He Leu Gly Asn Gin Ser Gin Glu Thr Leu Gin Thr 
290 295 300. 

gtg acc ate tac age ttt ccg gcg ccc aac gtg att ctg acg aag cca 960 
Val Thr He Tyr Ser Phe Pro Ala Pro Asn Val He Leu Thr Lys Pro 
305 310 315 320 
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gag gtc tea gaa ggg acc gag gtg aca gtg aag tgt gag g'cc cac cct 1008 
Glu Val Ser Glu Gly Thr Glu Val Thr Val Lys Cys Glu Ala His Pro 
325 330 335 



aga gec aag gtg acg ctg aat ggg gtt cca gec cag cca ctg ggc ccg 1056 
Arg Ala Lys Val Thr Leu Asn Gly Val Pro Ala Gin Pro Leu Gly Pro 
340 345 350 



agg gec cag etc ctg ctg 
Arg Ala Gin Leu Leu Leu 
355 

ttc tec tgc tct gca acc 
Phe Ser Cys Ser Ala Thr 
370 

aac cag acc egg gag ctt 
Asn Gin Thr Arg Glu Leu 
3 85 390 



aag gec acc cca gag gac 
Lys Ala Thr Pro Glu Asp 
360 

ctg gag gtg gec ggc cag 
Leu Glu Val Ala Gly Gin 
375 380 

cgt gtc ctg tat ggc ccc 
Arg Val Leu Tyr Gly Pro 
395 



aac ggg cgc age 1104 

Asn Gly Arg Ser 

365 

ctt ata cac aag 1152 
Leu lie His Lys 



cga ctg gac gag 1200 
Arg Leu Asp Glu 
400 



agg gat tgt ccg gga aac tgg acg tgg cca gaa aat tec cag cag act 1248 
Arg Asp Cys Pro Gly Asn Trp Thr Trp Pro Glu Asn Ser Gin Gin Thr 
405 * 410 415 

cca atg tgc cag get tgg ggg aac cca ttg ccc gag etc aag tgt eta 1296 
Pro Met Cys Gin Ala Trp Gly Asn Pro Leu Pro Glu 'Leu Lys Cys Leu . 
420 425 430 

aag gat ggc act ttc cca ctg ccc ate ggg gaa tea gtg act gtc act 1344 
Lys Asp Gly Thr Phe Pro Leu Pro lie Gly Glu Ser Val Thr Val Thr 
435 440 445 



cga gat ctt gag ggc acc tac etc tgt egg gee agg age act caa ggg 1392 
Arg Asp Leu Glu Gly Thr Tyr Leu Cys Arg Ala Arg Ser Thr Gin Gly 
450 455 460 

gag gtc acc cgc aag gtg acc gtg aat gtg etc tec ccc egg tat gag 1440 
Glu Val Thr Arg Lys Val , Thr Val Asn Val Leu Ser Pro Arg Tyr Glu 
465 470 475 "480 



att gtc ate ate act gtg gta gca gec gca gtc ata atg ggc act gca 14 88 
lie Val He lie Thr Val Val Ala Ala Ala Val He Met Gly Thr Ala 
485 490 495 ■ 



ggc etc age acg tac etc tat aac cgc cag egg aag ate aag aaa tac 1536 
Gly Leu Ser Thr Tyr Leu Tyr Asn Arg Gin Arg Lys lie Lys Lys Tyr 
500 505 510 

aga eta caa cag gec caa aaa ggg . acc ccc atg aaa ccg aac aca caa 1584 
Arg Leu Gin Gin Ala Gin Lys Gly Thr Pro Met Lys Pro Asn Thr Gin 
515 520 525 



gee acg cct ccc 1596 
Ala Thr Pro Pro 
530 
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<210> 2 

<211> 532 

<212> PRT 

<213> Homo Sapien 



<400> 2 














Met 


Ala 


Pro 


Ser 


Ser 


Pro Arg 


Pro Ala 


Leu Pro Ala Leu Leu Val Leu 


1 








5 






10 15 


Leu 


Glv 


Ala 


Leu 


Phe 


Pro Gly 


Pro Gly 


Aqn Ala Gin Thr Ser Val Ser 














25 


30 


Pro 


Ser 


Lys 


Val 


lie 


Leu Pro 


Arg Gly 


Gly Ser Val Leu Val Thr Cys 














40 


45 


Ser 


Thr 


Ser 


Cys 


Asp 


Gin Pro 


Lys Leu 


Leu Gly lie Glu Thr Pro Leu 




50 








55 




60 


Pro 


Lys 


Lys 


uxu 


Leu 


Leu Leu 


Pro Gly 


ZV^n Aqn Arcr Tivs Val Tvr Glu 


65 










70 




75 80 


Leu 


Ser 


Asn 




Gin Glu Asp 


ser uin 


Orn Mf»t* fVs Tvr Qf=»T* A^n Pvr 










85 








Pro 


Asp 


Giy 


Gin 


Ser 


Thr Ala 


Lys Thr 


Ph e> Leu Thr Va 1 Tvr Tim Thr 








iuu 








Tin 


Pro 


Glu 


Arg 


Val 


Glu 


Leu Ala 


Dfn T if^i i 

^ JL UCU 


PrcS Qer Ttti Gin Pro Val Glv 






11C 

-L X.D 








120 


125 


Lys 


Asn 


Leu 


Thr 


Leu 


Arg Cys 


Gin Val 


Glu Glv Glv Ala Pro Ara Ala 




130 








135 




140 


Asn 


Leu 


Thr 


Val 


Val 


Leu Leu 


Arg Gly 


Glu Lv^ Glu Leii Lvs Arcr Glu 


145 










150 




i i fin 

13j JL O U 


Pro 


Ala 


V ct J. 


gi v 


Glu 


Pro Ala 


Gin Val 

Ui Li. V Cl_L 


Thr* Thr* Thr* Val Leu Val Ara 










165 








A y-rt 


Asp 


Jtl-Lto 


XT -f a 


Gly Ala Asn 












i nn 

JLO VJ 








JL J y 


fLL 3 


Pro 


cm 


Gly 


Leu 


Glu Leu 


Phe Glu 


a o-n Thr* flpr Ala Pro Tvr Gin 














200 


205 


LeU 


Gin 


Thr 


Phe 


Val, 


Leu Pro 


Ala Thr 


Pro Pro Gin Leu Val Ser Pro 




210 








215 




220 


Arcr 


Val 


Leu 


Glu 


Val 


Asp Thr 


Gin Gly 


Thr Val Val Cys Ser Leu Asp 


225 










230 




235 240 


Glv 


Leu 


Phe 


Pro 


Val 


Ser Glu 


Ala Gin 


Val His Leu Ala Leu Gly Asp 










245 






250 255 


Gin 


Arcr 


Leu 


Asn 


Pro 


Thr Val 


Thr Tyr 


Gly Asn Asp Ser Phe Ser Ala 








260 






265 


270 


Lys 


Ala 


Ser 


Val 


Ser 


Val Thr 


Ala Glu 


Asp Glu Gly Thr Gin Arg Leu 






275 








280 


285 


Thr 


Cys 


Ala 


Val 


He 


Leu Gly 


Asn Gin 


Ser Gin Glu Thr Leu Gin Thr 




290 








295 




300 


Val 


Thr 


He 


Tyr 


Ser 


Phe Pro 


Ala Pro 


Asn Val lie Leu Thr Lys Pro 


305 










310 




315 32 0 


Glu 


Val 


Ser 


GlU 


Gly Thr Glu 


Val Thr 


Val Lys Cys Glu Ala His Pro 










325 






330 335 


Arg 


Ala 


Lys 


Val 


Thr 


Leu Ash 


Gly Val 


Pro Ala Gin Pro Leu Gly Pro 








340 






345 


350 


Arg' 


Ala 


Gin 


Leu 


Leu 


Leu Lys 


Ala Thr 


Pro Glu Asp Asn Gly Arg Ser 






355 








360 


365 


Phe 


Ser 


Cys 


Ser 


Ala 


Thr Leu 


Glu Val 


Ala Gly Gin Leu lie His Lys 




370 








375 




380 


Asn 


Glri 


Thr 


Arg 


Glu 


Leu Arg 


Val Leu 


Tyr Gly Pro Arg Leu Asp Glu 


385 










390 




395 400 


Arg 


Asp 


Cys 


Pro 


Gly, Asn Trp 


Thr Trp 


Pro Glu Asn Ser Gin Gin Thr 
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405 


410 










415 




Pro 


Met 


Cys 


Gin 


Ala Trp Gly Asn 


Pro Leu 


Pro 


Glu 


Leu 


Lys 


Cys 


Leu 








420 




425 








430 






Lys 


Asp 


Gly 


Thr 


Phe Pro Leu Pro 


He Gly 


Glu 


Ser 


Val 


Thr 


Val 


Thr 






435 




440 








445 








Arg 


Asp 


Leu 


Glu 


Gly Thr Tyr Leu 


Cys Arg 


Ala 


Arg 


Ser 


Thr 


Gin 


Gly 




450 






455 






460 










Glu 


Val 


Thr 


Arg 


Lys Val Thr Val 


Asn Val 


Leu 


Ser 


Pro 


Arg 


Tyr 


Glu 


465 








470 




475 










480 


lie. 


Val 


He 


He 


Thr Val Val Ala 


Ala Ala 


Val 


He 


Met 


Gly 


Thr 


Ala 










485 


490 










495 




Gly 


Leu 


Ser 


Thr 


Tyr Leu Tyr Asn 


Arg Gin 


Arg 


Lys 


lie 


Lys 


Lys 


Tyr 








500 




505 








510 






Arg 


Leu 


Gin 


Gin 


Ala Gin Lys Gly 


Thr Pro 


Met 


Lys 


Pro 


Asn 


Thr 


Gin 






515 




520 








525 








Ala 


Thr 


Pro 


Pro 



















530 



<210> 3 

<211> 3003 

<212> DNA 

<213> Homo Sapien 

<400> 3 

gctataagga tcacgcgccc cagtcgacgc fcgagctcctc tgctactcag agttgcaacc 60 

tcagcctcgc tatggctccc agcagccccc ggcccgcgct gcccgcactc ctggtcctgc 120 

tcggggctct gttcccagga cctggcaatg cccagacatc tgtgtccccc tcaaaagtca 180 

tcctgccccg gggaggctcc gfcgctggtga catgcagcac ctcctgtgac cagcccaagt 240 

tgttgggcat agagaccccg ttgcctaaaa aggagttgct cctgcctggg aacaaccgga 300 

aggtgtatga actgagcaat gtgcaagaag atagccaacc aatgtgctat tcaaactgcc 360 

ctgatgggca gtcaacagct aaaaccttcc tcaccgtgta ctggactcca gaacgggtgg 420 

aactggcacc cctcccctct tggcagccag tgggcaagaa ccttacccta cgctgccagg 480 

tSrgagggtgg ggcaccccgg gccaacctca ccgtggtgct gctccgtggg gagaaggagc 540 

tgaaacggga gccagctgtg ggggagcccg ctgaggtcac gaccacggtg ctggtgagga 600 

gagatcacca tggagccaat ttctcgtgcc gcactgaact ggacctgcgg ccccaagggc 660 

tggagctgtt tgagaacacc tcggccccct accagctcca gacctttgtc ctgccagcga 720 

ctcccccaca acttgtcagc ccccgggtcc tagaggtgga cacgcagggg accgtggtct 780 

gttccctgga cgggctgttc ccagtctcgg aggcccaggt ccacctggca ctgggggacc 840 

agaggttgaa ccccacagtc acctatggca acgactcctt ctcggccaag gcctcagtca 900 

gtgtgaccgc agaggacgag ,ggcacccagc ggctgacgtg tgcagtaata ctggggaacc 960 

agagccagga gacactgcag acagtgacca tctacagctt tccggcgccc aacgtgattc 1020 

tgacgaagcc agaggtctca gaagggaccg aggtgacagt gaagtgtgag gcccacccta 1080 

gagccaaggt gacgctgaat ggggttccag cccagccact gggcccgagg gcccagctcc 1140 

tgctgaaggc caccccagag gacaacgggc gcagcttctc ctgctctgca accctggagg 12 00 

tggccggcca gcttatacac aagaaccaga cccgggagct tcgtgtcctg tatggccccc 12 60 

gactggacga gagggattgt ccgggaaact ggacgtggcc agaaaattcc cagcagactc 1320 

caatgtgcca ggcttggggg aacccattgc ccgagctcaa gtgtctaaag gatggcactt 13 80 

tcccactgcc catcggggaa tcagtgactg tcactcgaga tcttgagggc acctacctct 1440 

gtcgggccag gagcactcaa ggggaggtca cccgcaaggt gaccgtgaat gtgctctccc 1500 

cccggtatga gattgtcatc atcactgtgg tagcagccgc agtcataatg ggcactgcag 1560 

gcctcagcac gtacctctat aaccgccagc ggaagatcaa gaaatacaga ctacaacagg 1620 

cccaaaaagg gacccccatg aaaccgaaca cacaagccac gcctccctga acctatcccg 1680 

ggacagggcc tcttcctcgg ccttcccata ttggtggcag tggtgccaca ctgaacagag 1740 

tggaagacat atgccatgca gctacaccta ccggccctgg gacgccggag gacagggcat 1800 

tgtcctcagt cagatacaac agcatttggg gccatggtac ctgcacacct aaaacactag ■ 1860 

gccacgcatc tgatctgtag tcacatgact aagccaagag gaaggagcaa gactcaagac 1920 

atgattgatg gatgttaaag tctagcctga tgagagggga agtggtgggg gagacatagc 1980 
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cccaccatga ggacatacaa ctgggaaata ctgaaacttg ctgcctattg ggtatgctga 2040 

ggccccacag acttacagaa gaagtggccc tccatagaca tgtgtagcat caaaacacaa 2100 

aggcccacac ttcctgacgg atgccagctt gggcactgct gtctactgac cccaaccctt 2160 

gatgatatgt atttattcat ttgttatttt accagctatt tattgagtgt cttttatgta 2220 

ggctaaatga acataggtct ctggcctcac ggagctccca gtccatgtca cattcaaggt 22 80 

caccaggtac agttgtacag gttgtacact gcaggagagt gcctggcaaa aagatcaaat 2340 

9999 ct 999 a cttctcattg gccaacctgc ctttccccag aaggagtgat ttttctatcg 2400 

gcacaaaagc actatatgga ctggtaatgg ttcacaggtt cagagattac ccagtgaggc 2460 

cttattcctc ccttcccccc aaaactgaca cctttgttag ccacctcccc acccacatac 2520 

atttctgcca gtgttcacaa tgacact cag cggtcafcgtc tggacatgag tgcccaggga 2580 

atatgcccaa gctatgcctt gtcctcttgt cctgtttgca tttcactggg agcttgcact 2640 

attgcagctc cagtttcctg cagtgatcag ggtcctgcaa gcagtgggga agggggccaa 2700 

ggtattggag gactccctcc cagctttgga agcctcatcc gcgtgtgtgt gtgtgtgtgt 2760 

atgtgtagac aagctctcgc tctgtcaccc aggctggagt gcagtggtgc aatcatggtt 2820 

cactgcagtc ttgacctttt gggctcaagt gatcctccca cctcagcctc ctgagtagct 2880 

gggaccatag gctcacaaca ccacacctgg caaatttgat tttttttttt tttttcagag 2940 

acggggtctc gcaacattgc ccagacttcc tttgtgttag ttaataaagc tttctcaact 3000 

gcc 3003 

<210> 4 

<211> 6 

<212> PRT 

<213> Homo Sapien 

<400> 4 

Ser Glu Lys Asp Glu Leu 
1 5 

<210> 5 

<211> 7 

<212> PRT 

<213> Horao Sapien 

<400> 5 

Arg Ser Glu Lys Asp Glu Leu 

1 5 . 

■<210> 6 
<211> 52 
<212> DNA 
<213> Homo Sapien 

<400> 6 

tctgttccca ggaactagtt tggcacagac atctgtgtcc ccctcaaaag tc 52 

<210> 7 

<211> 38 

<212> DNA 

<213> Homo Sapien 

<400> 7 

cataccgggg actagtcaca ttcacggtca cctcgcgg 38 

<210> 8 

<211> 799 

<212> PRT 

<213> Homo Sapien 
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<220> 

<221> CDS 

<222> (1) . . . (448)' 

<223> ICAM-1 Extracellular Domains 

<220> 
<221> CDS 

<222> (453) . . . (799) 
<223> Human IgA2m(2) 
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35 










40 








45 
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Lys 


Val 


Tyr 


Glu 


Leu 


Ser 


Asn 


Val 


Gin 


Glu 


Asp Ser Gin Pro 


Met 




50 
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60 
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Tyr 


Ser 


Asn 


Cys 


Pro 
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Gly 


Gin 


Ser 


Thr 


Ala Lys Thr Phe 


Leu 
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